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Chapter 1 

Introduction 



In this thesis we study the so cahed exit problem |34[ Section 4.3] for small 
noise diffusion. This model belongs to the more general area of random 
perturbations of dynamical systems, which has been a very active area of 
research over the last 30 years [13] , |34j , |52j . The small noise diffusion frame- 
work has attracted the interest of both the pure and applied mathematics 
communities. From the mathematical standpoint it is interesting because 
this area has strong interactions with other important branches of mathe- 
matics such as probability theory, dynamical systems, or PDE. As regards 
applied mathematics, the set of problems relating to small noise diffusion 
has found applications in climate modeling jlOj . [llj . electrical engineer- 
ing [16], [65], jt66j, finance |27j, |[31J, neural dynamics [55], [56] among oth- 
ers [23]. The main focus of the thesis, the exit problem, was originally mo- 
tivated by applications on the reaction rate theory of chemical physics |36j . 
Moreover, the work presented here, although purely theoretical, was moti- 
vated by neural dynamics [55] • We proceed to describe the setting, in order 
to provide a more extensive background. 

The setting of this problem is as follows. Given a smooth vector field 
6 : M'^ — )• M*^ consider the Ito equation driven by the d-dimensional standard 
Wiener process W: 

dX,{t) = h{X,{t))dt + ecj{Xe{t))dW{t), (1.1) 
X,(0) = xo. 

Here smooth uniformly non-degenerate matrix valued 

function. That is, the matrix a = a^a is uniformly positive definite. Under 



these assumptions we can ensure that equation (1.1) has a unique strong 



solution (see [41j or ^54j for all stochastic analysis references). 
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Given an initial condition xq S M'^ (or a set of initial conditions), the goal 
is to characterize some asymptotic properties of as e — t- 0. In particular 
we focus on the exit from a domain problem or exit problem for short. 
Consider a domain (open, bounded and connected) D C M.'^ with piecewise 
smooth boundary (at least C^). The exit problem is the study of the time 

rf (x) = inf{t > : X,(t) G dD}, 

at which exits, and the exit distribution Pxai^eirP) G •}. In this work 
we aim for a joint asymptotic result on the distribution of {t^ , X^{t[')) 
under certain assumptions for b. 

As we said before, this problem had its origin in chemical physics: it is 
a glorified model for the speed at which chemical reactions take place. The 
first model of this kind was proposed by Kramers [H], we refer to [H] for a 
modern treatment. 

From a pure mathematics perspective, the problem became of interest 
because it provides a framework to compute asymptotic (as e ^ 0) proper- 
ties for solutions of the Dirichlet problem 

Vue(x) • 6(x) + — Ane(x) = 0, x£D, (1.2) 

Ue{x) = g{x), X G dD. (1.3) 

Indeed, using a relaxed version of the Feynman-Kac theorem |41l Theorem 
4.4.2], the solution to this PDE can be written as the average lie(x) = 
Ea;g'(Xe(r/')), where is the solution of (1.1) with a = Id. Solutions to 
other PDE's can be written as a similar average, but for this discussion we 
choose (1.2) since it is representative of the area. Although the asymptotic 
study of the function is now known to be strongly related to the solution 
to the exit problem, the first studies relied only on analytic non-probabilistic 
arguments. For example, in [25], [39], [IQ] under some assumptions on the 
drift 5, they were able to rigorously write the solution as a formal series in 
e. In [60j it is proved, under the assumption that 6(0) = and that D is 
contained in the basin of attraction of 0, that as e — )• 0, converges to a 
constant. This result is of great importance since it means that the system 
forgets its initial condition. However, this is not always true. For instance 
in [2j using functional analysis and dynamical systems for the Levinson case 
(see section 1.1) it is proved that the limiting function at x depends on the 
orbit of X under the action of the flow generated by b. In [43J and [44J 
a combined approach involving probabilistic and purely analytic arguments 
was put into practice. Very general limit theorems were obtained, but strong 
restrictions on the non-linearity of the drift b were imposed. 
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The standard mindset in tackling this problem from the probabilistic 
point of view is to think of the SDE that defines as a (random) sin- 
gular perturbation of the system x = b{x). In this context it is natural 
to expect that the methods used to study the exit problem lie in the in- 
tersection between probability theory and dynamical systems. Freidlin and 
Wentzell [31], were the ones who put together a general theory in this 
direction. They based their theory on the Large Deviation principle for X^. 
This result was then used as the building block in constructing what today 
is known as the Freidlin- Wentzell theory. The core of this theory strongly 
relates the exit behavior of to the properties of the vector field b by 
providing two elements: 

1. It defines a function V : D x dD — )• [0,oo], known as quasi-potential, 
that characterizes the exit distribution. Indeed, the exit distribution 
of is asymptotically concentrated on the set ^^-^ of minimizers of 
V{Xe{0), •). Moreover, under the assumption that a is uniformly non- 
degenerate, V, and hence Vx.(o)^ depends mostly on b. For example it 
can be shown that in the case of 6 = V0, V is proportional to (p. This 
is the reason why the V function is called quasi-potential. 

2. It shows that in the case in which the domain is contained in the basin 
of attraction of an equilibrium , log tP converges in probability to 
the minimum of V{X^{0), •). This result is of vital importance since it 
gives a hierarchy of transition for the case in which there are several 
equilibria. See [34^ Section 6.5] for more background on this particular 
direction. 

The theory came to light with a series of papers beginning with [62j and [63] 
until the Russian edition of the book |i34j appeared. See |32i and [33j for 
a modern version of the theory, and [T7], [19] and references therein for a 
stochastic partial differential equations version of the theory. In Section |1.1| 
we give a brief review of the Freidlin- Wentzell theory necessary to under- 
stand the motivation of this work. 

In contrast with the Freidlin- Wentzell theory, that mostly relies on the 
large deviation principle, a modern trend relying on a path-wise approach 
has emerged in the last years. As a consequence, more detailed phenomena 
can be captured. That is the case, for example, in [4j in which a heteroclinic 
network is considered or in |12j in which a bifurcation problem is studied. 
The monograph |13| contains several examples in this direction together 
with applications. 
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In this thesis, a modern and more complete treatment of two cases is de- 
veloped: the saddle case in which the vector field has a unique saddle point 
and the Levinson case in which the deterministic dynamics escape from the 
domain in a finite time. The two results when combined complete the treat- 
ment of the case in which the underlying dynamics admit a heteroclinic 
network as studied in [4J. We also provide a 1-dimensional example that ex- 
plains how to obtain a correction to the exit time for a diffusion conditioned 
to exit through an unlikely exit point. The approach presented here relies 
heavily on the underlying dynamical structure, and combines techniques 
from differential equations, bifurcation theory and martingale theory. 

The rest of this chapter is organized as follows. In Section |1.1| we give 
a brief introduction to Freidlin-Wentzell theory. In Section |1.2| we study 
the exit problem in the case the system x = b{x) has a saddle point. In 



Section 1.3 we study the escape when it takes a finite time in the so called 
Levinson setting. Applications of these two cases are presented in Sec- 
tions |1.4.1 and 1.4.2[ The general setting and a brief description of the 



chapters is given in Section 1.5 



1.1 Background and Motivation 

In this section we gather the background tools that will allow us to explain 
where our results stand with respect to the classical Freidlin-Wentzell theory. 



Consider , the strong solution to the SDE (1.1). If seen as a random 
perturbation, the equation for suggests that the process should behave 
like the flow generated by b: 

j^S^x = b{S^x), S^x = X. (1.4) 

Indeed, through a standard martingale argument, it is easy to see that for 
any 6 > there are constants C^"^^ and d^'g such that 

sup \X,(t) - >s \ < c!^le-^Ty\ (1.5) 

t<T J 

This inequality can be used to show that P|, the law of X, on C([0, T]; M"') 
conditioned to X^{0) = x, converges weakly (on the space C{[0,T];U.'^)) to 
the measure concentrated on the orbit of x. See |15j for a series expansion 
in e and [18] for a series expansion of more general stochastic flows. 




The question now is to find the optimal constant in (1.5), or more 



generally to find a large deviation principle [23], [23] or Appendix [A] 
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Theorem 1 (Preidlin- Wentzell [34j ) Let Hq j, be the space of all abso- 
lutely continuous functions from [0, T] to with square integrable deriva- 
tives. Define the functional I^ by 

= \ j\v is) - b{^{s)),a-H^{s)){V> is) - bi^isMds, (1.6) 

if if £ H^rp and 9^(0) = x, and oo otherwise. Here b is the drift in (1.1) and 
a = a'^a, with a the diffusion matrix in (1.1). 

Then for each x S M*^ and T > the family (P|)e>o satisfies a Large 
Deviation Principle on C([0,T];M'^) equipped with uniform norm at rate 
with good rate function I^. 

See [9j and [51J for more large deviations results. In order to have this 
thesis as self contained as possible, we give a large deviation overview on 
Appendix . 

Informally, Theorem [l] says that if A C C{[0,T];R'^) then 



{X, G yl} X e-' 



Intuitively, due to (1.6), this result suggest that If. serves as a measure on 
how costly (in terms of probability) is for the system not to follow the 
deterministic trajectory. This interpretation is essential when solving prob- 
lems that require non-compact time frames, in particular, when studying 
the exit problem described above. 

Regarding I^ as a cost function, it make sense to introduce 

Vix, y) = mf {lUv) : ^{T) = y, ^{[0, T]) C D U dD} (1.7) 

as the cost to go from x to y inside D. The function V : D x dD — t- [0, oo], 
known as the quasipotential, plays an important role on the exit problem 
we described: 

Theorem 2 (Preidlin- Wentzell [34| ) Suppose X^{0) = xq and let 

y€aD 

Then for every closed set N C dD such that inf^gAr V{xo,y) > z, 

limP,jX,(r,) GiV} = 0. 
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Figure 1.1: Example with several minimizers 



The necessary observation derived from this theorem is that, in the limit, 
the exit occurs in a neighborhood of the set of minimizers of the quasipo- 
tential. The limitation is that when there are several minimizers, the result 
doesn't provide any distinction between them. For example, suppose the 
phase portrait of S is as in Figure |1.1| and D is a rectangular region as in 
Figure [LT] Then the set of minimizers consists of 3 points: qi, q2 and q^. 
Freidlin-Wentzell theory ensures that, asymptotically, the exit occurs out- 
side this set with exponentially small probability, but doesn't distinguish 
between the minimizer. For example, the theory is not able to establish if 
it is more likely to exit on a neighborhood of gi or in a neighborhood of q2- 
Bakhtin [3j started a theory that will allow us to compute the probability 
of exiting close to each of the minimizers. To complete this theory is part 
of the motivation for this work. 

This work (see also ^j) drops some technical assumptions needed in [4j 
in the planar case. The program makes intensive use of normal form theory 
and provides small noise estimates for non-linear diffusion. With this work, 
the asymmetric behavior found in is extended to arbitrary Hamiltonian 
systems on the plane. Moreover, the present text also (see p]) provides exact 
scaling corrections when the flow S exits the domain in a finite time. This is 
a step towards to both Freidlin-Wentzell theory and Bakhtin's heteroclinic 
result, since extension to the case in which S is asymptotically stable can 
be carried out by time reversing. In this direction, a 1 dimensional example 
is presented in Chapter |3j This is an open and promising future research 
area, since it may provide scaling limits for exit points under very general 
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assumptions on b. 

1.2 Escape from a Saddle 

In this section, we assume that the system has a unique critical point and 
that point is a saddle. Without loss of generality, suppose that the critical 
point is the origin; that is, we are assuming that G M*^ is the only point 
X € D L) dD such that b{x) = 0, and the matrix A = V6(0) has spectrum 
bounded away from zero with at least one eigenvalue with positive real part 
and one eigenvalue with negative real part. In other words, there is a pair 
of integers i', fi > 1 such that the eigenvalues Ai, A^^ of ^ satisfy 

ReAi = ... = ReA,, > ReA,,+i > ... > ReA^ > > ReA^+i > ... > ReA^. 

Under this assumptions, it is well known [64:\ Theorem 3.2.1] that D = 
DUdD can be decomposed as D = {0} UWUWUW, where 



W" = {x G D : lim S^x = 0, and for some s > 0, S'-'^^'^^x C D and S^''°°^x nD = 
s>— oo 

W^ = {xeD: c D and S^'^'^^^'x n L> = 0, si < and > 0}, 

and, 

= {x£ D : lim S^x = 0, and for some s < 0, S^-'^''^xnD = and S'^'-'^^x C D} 

Here, for an interval A C M, S^x denotes the set 

= {5*x :teA}. 

We are ready to state the theorem in [44j concerning the exit time r^: 
Theorem 3 If x e D , then 

Tf p 1 

;^0. 



log e Re Ai 
Consider the (deterministic) time 

T{x) = inf{t >Q:S^xe dD}, 
then, ifxe {W^UW)nD, 

Te ^ T{x), e 0. 
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In order to state the corresponding theorem for the exit distribution, 
denote Fmax the generahzed eigenspace of A which corresponds to Ai, A,y. 
The Hadamard-Perron theorem \53\ Section 2.7], |64| Theorem 3.2.1] states 
that there is a z^-dimensional 5*-invariant submanifold Wmax tangent to 
Pmax at the origin. Note that the intersection Qmax = W^max H dD is not 
empty. Moreover, in the case of > 1, Qmax is a — 1-dimensional manifold, 
while for 1/ = 1 consists of two points: Qmax = {Q-,Q+}- The result in [43] 
reads: 

Theorem 4 If x & D D and v > 1, then for any relatively open subset 
G C dD such that Qmax C G, it holds that 

limP,.{X,(r,) eG} = 1. 

If V = 1 then the measure Pf(-) = V{X^{Te) G •|Xe(0) = x} converges 
weakly to the measure \5q_ + \5qj^, where 6z is the probability measure con- 
centrated at z. 

If X £ {W^ U W") n D, Pf (•) converges weakly to the measure 6gT{x)^. 

In the work by Day [,22] a refinement to the theorem about the exit 
time is given in the 2— dimensional situation. He proved that Air^ can be 
written as a sum between — log e and a tight correction. He gave a precise 
asymptotic description for the distribution of the correction: 

Theorem 5 If d = 2 and X,{0) eWnD, then 

Airf + loge^/C + C7^ 

in distribution. Here KL and Gi, are independent random variables. More- 
over, K, has a density with respect to Lebesgue measure given by 

/vr 



and Cy is a Bernoulli random variable with P{Cjy = G±} = 1/2, where C± 
are a constants depending only on b and a. 



This theorem complements previous work by Mikami [3^ which estab- 
lished the decay in the distribution of —i^t^^- 

Theorem 6 For an arbitrary d> 1 and X^{0) e W D D for T £ (0, 1) it 
holds that 



lini-^logf-logp|--^rf <T[> ) = T - 1, 
e-s>o log e \ [ log e 
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while for T > 1, 

lim -^P |-7^rf > t| = (T - l)/2. 
e^o loge [ loge J 

Theorem [6] is a good refinement of Theorem |3j but it doesn't prove that 
the distribution of the difference Xit^ + log e is tight as in Theorem [5] The 
tails established in Theorem [6] are consistent with the tails of Theorem [H 

Bakhtin [5] gave a refinement of the last theorems. He proved that in 
any dimension the family of random variables + A^^loge converges in 
distribution and he identified the limit. He also improved the result about 
the convergence on the exit distribution in Theorem |4j In the case 1^ = 1, 
he showed that the factor 1/2 in the limiting distribution ^6q^ + ^Sq_ comes 
from the symmetry of a certain Gaussian random variable. 

Theorem 7 In the case u = 1, parametrize the manifold WVax as a C^- 
curve 7 that can he locally represented as a graph over the (one dimensional) 
space Tmax- Let the times t{±6,q±) be the time it takes to the deterministic 
flow to go from 7(±5) to q±: 

t{±6,q±)=T{j{±6)), 

with T{x) defined as in Theorem^ 
Then, the numbers 

/i± = lim (^^+t{±5,q±)] 
<5->0 \ Ai / 

are well defined and, in the case where a = I , and x £ D Ci , there is a 
Gaussian random variable M and a number k = k{x) > such that : 

1. e — )• the convergence 
and 

1 P 1 

+ — loge — > /i+l|^>o} + /i_l{Ar<o} - T- log(KAA) 

hold in probability. 
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2. As e — )• the distribution of the random vector {X^{t^),t^ + j^loge) 
converges weakly to the measure 

where fJ'h±,uj is the distribution of 

h± - -^log(KAA). 

The proof of Theorems [5] and [7] is based on the study of the hnear system 
and then an approximation to the non-hnear one. The steps followed for a 
prototypical 2— dimensional system is presented in the next section. Part of 
this thesis focus is on adapting this methodology to the non-linear case. 

1.2.1 A 2-dimensional linear example 

For a fixed 6 > 0, consider the domain D = (—5,5) x (—6,5) C M?. Given 
two positive numbers X± > 0, we sketch the solution to the exit problem 
from D for the diffusion = {xl,x'^) given by 

dX,{t) = diag(A+, -X-)X,{t)dt + edW{t), 
X,iO) = (0,xo) G D. 

Here, for a column vector v = (fi,U2), vi is the first coordinate and V2 is 
the second one. 

Using Ito's formula [41 ^ Theorem 3.3.3] in each coordinate we write the 
Duhamel principle for xl and 

xl{t) = ee^+* [\-^+'dWi{s), (1.8) 
Jo 

xl{t) = e-^-'xQ + e f e-^-(*-^)dW2(s). (1.9) 
Jo 

These two identities are the main ingredient in this development. 

We will show that exits D along (5,0) or (—(5,0). Consider the time 
at which x\ exits the interval {—5,5): 

rf = inf{t > : \x\{t)\ = 5}. 

The program is to compute based on the path- wise properties of X^. Use 
the identity obtained for to characterize X^{t^) from which we will deduce 
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that P{rf = T^} 1 as e 0. Hence, we can obtain the limiting behavior 
of {T^,Xf) based on the identities we have obtained for (t^ ,Xe{T^)). This 
win estabhsh a result in the spirit of Theorem [7] 

First, note that < oo with probabihty 1. This is a classical fact that 
we prove in Appendix [b| for completeness. On the set {rf < oo}, define the 
random variable Me by 



'dWi{s). 



An application of Duhamel's principle (1.8) for xl and the definition of 
establishes the equality 6 = ee^+'^^ |A4| with probability 1. This identity 
implies that, with probability 1, 



1 1 1 1 

— loge + — log 



_6_ 



Using equality (1.10) together with (1.8) and (1.9) it holds that 



and 



x\{T!) = 6sgn{K), 



+ e 



^-(t-^)dVF'2(s), 



:i.io) 



(1.11) 



:i.i2) 



both with probability 1. Hence, if we can establish tightness for the distri- 
bution of the family of random variables (A/'e)e>0) the fact that P{t^^ = t^} 



converges to 1 would be a consequence of (1.11), (1.12) and the tightness of 



the distribution of the stochastic integral in (1.12 



In order to get the tightness result, we need to analyze the time with- 
out any reference to (1.10). We can show two properties (see Appendix [B] 
for their proofs): 

1. For every 5 > 0, — )■ oo in probability as e — )■ 0. 

2. As a consequence to the last point, as e — )• 0, 



A4 



/•oo 

Jo 



Let Af be the limit Gaussian random variable of {Me)e>o in the second ob- 
servation above. Then, we have proved the following lemma: 
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Lemma 8 



A, 



log 



6 

W\ 



0. 



We apply this lemma to the exit distribution of X^. Before that, let us 
denote J\f- a zero mean Gaussian random variable with variance (2A_)^^ 
independent of A/". It is possible to prove that 



0, 



in distribution. This convergence combined with the convergence of and 
Afe used in (1.11) and (1.12) implies that on the set {t^ = t^}, 



X,(rf ) = 5(sgnA4,0) + e(^-/^+)^i(0,6)- 
Here {^e)e>o is a family of random variables that satisfies 



a;ol{A_<A+} +A/'-l{A_>A+}, e-s-O, 



(1.13) 



:i.l4) 



in distribution. Moreover, it can be shown that when A_ < A+ this conver- 
gence holds in probability. 

Hence Theorem [7] holds with j{t) = (t, 0) up to time re-parametrization. 

1.2.2 Analysis and generalization 

The simplified argument of last section not only recovers, for this simple 
linear case. Theorem [7] but also provides more information. From (1.12) we 
can see that when A_ < A-(_ the exit distribution has a bias in the stable 
direction. In this case, the second coordinate of the exit distribution is not 
centered. Indeed, from (1.13) and (1.14) we can identify 3 cases: 

1. When A- > A+, the second coordinate of converges to J\f-. Hence, 
the exit has a centered distribution in the stable direction. We refer 



to this case as the symmetric case and is illustrated in Figure 1.2 



2. When A_ = A_|_, the second coordinate of X^ converges to ^-j- 

M-. Hence, the exit has a bias in the stable direction due to the initial 
condition xq. We refer to this case as the asymmetric case. 



2^0 + 
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Figure 1.2: Symmetric Case. 



M 



A-/A-I 



3. When A_ < A+, the second coordinate of converges to j xq. 
Hence, the exit has a very strong bias in the stable direction due to the 
initial condition xq- We refer to this the strongly asymmetric 



case and is illustrated in Figure 1.3 



The consequences of such an asymmetry, when there is one, have been ex- 
plored in |3] , and it turned out to be an important improvement to Freidlin- 
Wentzell theory for a particular situation. We will summarize this improve- 
ment in Section 1.4.2 For now let us comment about how general the 



argument of last Section 1.2.1 really is 



The immediate limitation of the argument presented in Section 1.2.1 



IS 



that it is mostly based on explicit representations for the solution of X^. 
In [5], [1] and [22] a linear approximation to the original process is made. 
The non-optimal feature of this procedure is that we lose all the identities, 
and we have just approximations. This is not acceptable if we are interested 
on computing the properties of the aforementioned asymmetry. 

A similar but structurally different argument is presented here in Chap- 
ter [2] ( see |lj also) in which a change of variable is introduced to linearize the 
system locally. Consider the process Y^{t) = f{X^{t)) and for the moment 
assume / to be smooth. Then, the new process solves the SDE: 



dV, = (Vr i(y,)) ' b{f-\Y,))dt + eaiY,)dW + e^^iY,)dt, 
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Figure 1.3: Strongly Asymmetric Case. 



for some smooth (depending on the function a) functions ^ : 



a 



. If we can choose the function / to be such that 

{yr\y)y'b{r\y)) = Ay, 



and 



:i.i5) 



then the proof of the theorem wih fohow almost identically as the case in 



Section 1.2.1 (only with multiplicative noise instead of additive one) giving 



the asymmetry for the general case. The existence of such a transforma- 
tion belongs to the study of conjugation in Ordinary Differential Equations. 
The main result of the latter theory (the so called Hartman-Grobman The- 
orem [371 Theorem IX.7.1], Section 2.8]) guarantees the existence of a 



homomorphism / that solves (1.15). This is not enough for our purposes 



since we need / to be in order to use Ito's formula. In 0] is assumed 
that the transformation / exist and is C^. In order to become aware on 
how restrictive (if restrictive at all) is this hypothesis, we need to study 
transformations / that satisfy a relaxed (in a sense explained below) version 



of (1.15). Such transformations are the main subject of study in normal 
form Theory [20], |l2], [53], [M]. 

We will summarize the main ideas in normal form theory. Following 
we call a set of complex numbers A = (Ai, A2, A^) non-resonant if 



there are no integral relations between them of the form Aj = a • A, where 
a = («!,..., CKrf) G Z^J_ is a multi-index with \a\ = 0:1 + .. .+0:^ > 2. Otherwise, 
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we say that it is resonant. Moreover, a resonant A is said to be one-resonant 
if all the resonance relations for A follow from a single resonance relation. 
A monomial called a resonant monomial of order R 

if a • A = Aj and \a\ = R. Normal form theory asserts (see [50], [5Sj) that 
for any pair of integers i? > 1 and A; > 1, there are two neighborhoods of 
the origin Qj and fig and a C'^-diffeomorphism f : ^ Q,g with inverse 
g : VLg ^ Qf such that 

{Vr\y)Y^Kr\y)) = Ay + P{y)+n[y), yGQg (1.16) 

where P is a polynomial containing only resonant monomials of order at 
most R and TZ{C) = 0(|C|^^^)- If ^ is non-resonant, then / can be chosen 
so that both P and TZ in ( |1.16| ) are identically zero. Moreover, due to [381 
Theorem 3, Section 2], if A is one-resonant then / can be chosen so that TZ 



in (1.16) is identically zero. 

The result in [4] only includes the non-resonant case. When applied to 
heteroclinic networks (network of saddles interconnected to each others) this 
assumption impose a restriction by requiring each critical point to be non- 
resonant. In particular, typical Hamiltonian systems (that usually present 
heteroclinic structures) have resonant relations due to the symplectic struc- 
ture [20]. 

In this work, we give a complete solution to the 2— dimensional case with 
no assumptions about resonance, and with random initial conditions. The 
theorem (see [Ij also) informally reads as 



Theorem 9 Let be the solution of equation (1.1) with initial condition 
given by 

X,(0) =xo + e%, 

where a £ (0, 1) and (^e)e>o is a family of random vectors that converges 
weakly to the random vector ^o- We assume that xq G W and is such 
that 

P{b{xomo} = 0, 

where \ \ means collinearity of vectors. 

Then, there is a family of random vectors {(pe)e>o, o family of random 
variables ('ipe)t>o, and the number 



1, aA_ > A_ 



a^, aX- < A+ 



such that 



^e{Te) = Qsgn{iP,) + e^<; 
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The random vector 

converges in distribution as e ^ to the limit Qq that can be identified. 
The exit distribution exhibits the behavior presented at the beginning of Sec- 
tionlMM 

The proof of this theorem is divided in two steps. One, is the study of 
the diffusion when is close to the origin. The second, is the study of the 
diffusion X^^ far from the origin. Here, the meaning of close and far relies on 
whether or not the system can be conjugated to its Normal Form. 

By the study of close to the critical point, we mean X^ £ B, where B 
is a neighborhood of the origin in which normal form conjugation is valid. 
In this case, the analysis has two parts. The first part is when the diffusion 
starts along the stable direction. To study this part, we study the diffusion 
until the the projection of X^ along the stable direction dominates over the 
noise level. We achieve this by posing the problem as an exit problem from 
the strip [— e°, e"] x B, for some a £ (0, a). For the second part we study the 
exit problem from B with the initial condition being the exit distribution 
obtained in the first part. In order for this program to be successful, we 
require very precise path-wise expressions for the diffusion. By using the 
general form of the resonances we are able to preserve the essence of the 



argument in Section 1.2.1 



The analysis of the system far from the origin is used twice. The first 
time is when the system starts along the stable manifold W. The second 
time is when is about to exit the domain and it probably has some bias. 
Our study is based on a series expansion in powers of e > 0. This expansion 
is inspired by [15i , but it requires additional geometric arguments. These 
results are of independent interest, so we start a new section to describe 
them. 



1.3 Levinson Case. 

This section is devoted to the Levinson case. We will first review the history 
of the problem and then outline our contribution. 

Given an initial condition xq £ D, Levinson condition is a hypothesis 
associated to the fiow S and the domain D. This case was originally for- 
mulated in with a PDF fiavor, we state the condition as presented by 
Freidlin ^ Chapter 2]. 
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Condition 10 (Levinsion) The flow S satisfies the Levinson condition at 
xq £ D with respect to D if the following holds: 

1. The exit time 

T{xo) = inf{t > : S^xq € dD}, 

is finite. 

2. The flow S^xq leaves the domain immediately after T[x). That is, 
there is a 6 > such that 5^(^o)+' ^DUdD for all s G (0, 6). 

We say that the domain D cMJ^ satisfies Levinson condition if properties 1 
and 2 are satisfied at each x £ D. 

As mentioned in tlie introduction, Levinson 06], [17] was originally inter- 
ested in studying the behavior of the solution of the PDE: 

Vue(x) • b{x) + —Au^{x) =0, X € D, 

Ueix) = g{x), X G dD. (1.18) 

The claim is that the solution to this PDE has to converge to the solution 
of the unperturbed PDE: 



Theorem 11 ( [46j ) Under the Levinson condition 10, there is a unique 
(maybe generalized) solution to both, the perturbed problem (1.18) and the 
unperturbed problem 

Vuo(x) • b{x) = 0, X e D, 

uo{x) = g{x), X £ dD. 

Let uq : ^ M'^ be a solution to the unperturbed problem. Then, if g : 
dD R'^ is smooth, 

Ue{x) = Uo{x) + eVe{x), 

where Ve{x) is a locally bounded function for each e > 0. 

The proof of this theorem is based on a series expansion of along the 
characteristics of (1.18). In order to prove the result with this idea, several 
technical challenges had to be overcome in [46j . Contrastingly, as pointed out 
in [35], the probabilistic approach here is simpler and cleaner. Indeed, once 



we know inequality (1.5), the convergence in Theorem 11 is an immediate 
consequence of the stochastic representation Ue{x) = ExgiX^{TP)), where 
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solves (1.1) with a = Id. To get the exact behavior a series expansion of 
the processes has to be made as in [15] and [311 Chapter 2]. 

In this thesis we develop a path- wise approach to this problem. We give 
a geometrical characterization of the exit point X^{t^), and joint properties 
of {X^{t^),t^) are obtained. We start by obtaining a generalization of the 
series expansion given in [TJ]. This result serves as backbone of our proof 
and has independent interest on itself. 

In order to present our result, we need further notation. Write b as 

6(x) = b{y) + Vb{y){x - y) + Qi{y,x - y), x,y e M"', 

where 

\Qi{u,v)\ < K\vf 

for some constant K > and any u,v ^ M*^. 

Denote by $a;(^) the linearization of S along the orbit of x: 

^^^^{t) = A{t)^^{t), $.(0) = /, (1.19) 
at 

where A{t) = V6(5*x) and / is the identity matrix. We can state our first 
lemma: 

Lemma 12 Consider the initial value problem 

dX,{t) = {b{X,{t)) + e°i^'e(X,(t))) dt + ea{X,{t))dW (1.20) 
X,(0) =xo + e"'e., e>0. (1.21) 

where, for each e, ^'^ is a deterministic Lipschitz vector field on M.'^ converg- 
ing uniformly to a limiting Lipschitz vector field ^q. Both ol\ and 02 are 
positive scaling exponents. The family of random variables {£,e)e>o converges, 
as before, to in distribution as e — )• 0. 
Let 

^^(t) = e"2-"<I>,,(t)ee + e"^"°^xo(i) r ^xo(5)"'^o(5'xo)ds 

JO 

+ (t) / ^.,,{s)-^a{S'xQ)dW{s), 

Jo 



and 



^xo(t) / $xo(s)-'^'o(5^x)ds 
Jo 

+ l{i=„}<I>:.o(i) / '^-l{s)a{S'xo)dW{s), t > 0. 
Jo 
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Then, 

holds almost surely for every t > 0, where ^Peit) = (t>e{t) + fe{t), cind r^ con- 
verges to uniformly over compact time intervals in probability. Moreover, 
for any T > 0, tp^ ^ (po, in distribution in C[0,T] equipped with uniform 
norm. 



The reason to consider (1.20) instead of the more standard (1.1) wih 



become evident in Section 1.4.1 For now, let us observe that, although 



inequality 1.5 may not hold, it is still true that 



sup \ X^{t) 

t<T 



0, 



0. 



Hence, under the Levinson Condition 10 the exit of from D will occur 
on a finite (still random) time and very close to the deterministic exit z = 
5^(3:^0 better understanding of this convergence is the main result in 

this section. 

The main theorem provides a scaling limit to the distribution of (r^, Xe(re)). 
In order to understand the theorem, we regard the boundary of D as an hy- 
persurface M in M*^. In general, for an hypersurface M in W^, denote the 
tangent space of M at the point z £ M as T^M. Further, we denote the 
(algebraic) projection onto span(6(z)) as tt;, : M'^ — )• M , and the (geometric) 
projection onto T^M along span(6(z)) as ttm '■ — ^ T^M. In other words, 
for any vector v £ M"', 7ri,v G M and ttmv G TzM are the unique number and 
vector such that 

V = TTbV ■ b{z) + ttmv. (1-22) 
With this notation in mind, we are ready to state the theorem: 

Theorem 13 Let M be an hypersurface in M*^. Let X^ be the solution 
of (1.20) with initial condition (1.21). Consider Te andT{xo) the exit time 
from M of X^ and S respectively, //a = ai A a2 A 1 and z = S^^^'^'^xq, then 



T.XAt, 



z) (-7rfe0o(r),7rM(Ao(r)), 



(1.23) 



in distribution. Here VTb and ttm o.re as in (1.22). 



1.4 Applications 

1.4.1 Conditioned diffusions in 1 dimension 

Throughout this section, we restrict ourselves to the 1-dimensional situation. 
In particular, let, for each e > 0, be a weak solution of the following (1 
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dimensional) SDE: 

dX,{t) = h{X,{t))dt + ea{X,{t))dW{t), 

where h and a are C"^ functions on M, such that h{x) < and (t{x) ^ for 
all X in an interval [oi, 02] containing xq. We want to study the exit of such 
an interval D = [01,02], that is 

= inf{t > : Xf^{t) = ai or 02}. 

Let = {X^ije) = 02}, and note that, since 6 < 0, lime_s>o P(-Be) = 0. More 
precise estimates on the asymptotic behavior of P(-Be) can be obtained in 
terms of large deviations. However, our interest is to study the process 
conditioned on the rare event i?^- 

In this case T{xq), the time it takes for the flow S generated by —h 
starting at xq to reach 02, is given by 



2 2/ \ he{x) 



It is known from our basic Lemma 12 that — )• T(xo) as e — )• in proba- 
bility. But the correction was not known so far. 

The idea is to condition the diffusion to the event and note that this 
conditioned process solves a martingale problem (hence is a diffusion) and 
the result from Section 1.3 are applicable. Hence, we have the lemma: 

Lemma 14 Conditioned on B^, the process is a diffusion with the same 
diffusion coefficient as the unconditioned process, and with the drift coeffi- 
cient given by 

6e(a;) = 6(2;) +eV^(x)- X , / x , , 
Jai hMdy 

where 

With this lemma and the help of an analogy of Laplace's method the 
main theorem in this direction is: 

Theorem 15 Conditioned on B^, the distribution of e~^{Te — T(xo)) con- 
verges weakly to a centered Caussian distribution with variance 

aHv). 



CHAPTER 1. INTRODUCTION 



24 



The result is of relevance not only because of the correction itself, but 
also, because is the first step of analysis for diffusions conditioned on rare 
events. Such a tool may lead to a general theory of correction in small noise 
systems. 

1.4.2 Planar Heteroclinic Networks 

A further application of our results is to the theory of Noisy Heteroclinic 
Networks first proposed in [3]. Our presentation applies only to the 2- 
dimensional situation. See for a survey in this direction. 



In Section 1.4.2 we give an intuitive presentation of the argument in |3]. 



The general theory developed in [1] is presented in Section 1.4.2 In subsec 



tion 1.4.2 relations of this result to the current text are highlighted. In this 



section, we also introduce the idea of a random Poincare map. 



Intuitive argument 



We study the exit problem of the diffusion ( 1.1 ) from a domain D. Consider 
the vector field 6 : — )■ which has finite set of critical points Z = {Ck)k=i 
inside D. We assume that S admits an heteroclinic network. 

A heteroclinic network for the flow S is an invariant that contains at 
most countable number of saddles connected with each other. For simplicity, 
in this section we suppose that S admits an heteroclinic network with a 
finite set of critical points Z = {Ck)k=i inside D. Precisely, we assume the 
following: 

1. Each critical point Ck is a saddle point of the flow S. That is, 6(Cfc) = 
and the matrix Af^ = V6(Cfc) has two eigenvalues: > and — < 
0. 

2. The flow S generated by b admits an heteroclinic structure in D. We 
give the technical description of this assumption. For each critical 
point Zk € Z, let be the 1-dimensional locally stable manifold 
and the 1-dimensional locally unstable manifold. Take a 6 > 
small enough so that normal form conjugation and Hadamard-Perron 
invariant manifold theorem holds for a ball Bk = Bs{Ck) of radius 5 > 
centered at each critical point . Denote {q^,q^} = Ci Bs{Ck)- The 
hypothesis that b admits an heteroclinic structure means that for each 
integer 1 < A; < A^, there is an integer S {1, N} such that 

hm S'q^=Cn±. 
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3. All non degeneracy assumption made in Section 1.2 hold for each crit- 
ical point. 

Suppose the starting point for the diffusion X^{0) is a deterministic point 
in Wf . Theorem [9] implies that with high probability the diffusion will 
exit Bi approximately along qf or gj~ with equal probability. Moreover, 
Theorem [9] tells us how to compute the scaling exponent of the additive exit 
correction term and the asymptotic distribution of this correction term. 

For this discussion, we suppose that the diffusion exits Bi asymptot- 
ically close to qf. The exit from Bi will now be the initial condition in 



Lemma 12, Applying this lemma (with \I'e = 0) for sufficiently large T, 
we can derive the asymptotic representation of the entrance distribution 
for B +, which satisfies the properties imposed to the initial condition in 



Theorem^ Observe that Lemma 12 also implies that the scaling exponent 



of the additive correction in the entrance distribution for B + is the same 
as in the exit distribution for Bi. Moreover, this lemma implies that the 
asymptotic distribution of the additive correction term has in the entrance 
to B^+ is the evolved (under the linearization of S) version of the asymptotic 
distribution of the correction term at the exit from Bi. In particular, any 
bias on the exit of Bi gets translated, by the linearization of the flow, to 
the entrance of B^+. Let ii = nf. Then Theorem joj applies again to derive 
that asymptotically the exit distribution from B^+ is concentrated mostly 
along q-+ or q-- , but with possible unequal probability. We can proceed like 
this iteratively along any sequence of saddle points zi, Zj^, such that 
for any j, ij+i = n^{ij) or ij+i = n~{ij). 

The result of this procedure allows us to conclude that the system evolves 
in a Markov fashion (choosing the next saddle with probability 1/2 indepen- 
dently of the history of the process) until it meets a saddle point at which 
the exit distribution becomes asymmetric. After that the choice of the two 
heteroclinic connections is not Markov anymore. The choice of the two 
heteroclinic connections may become Markov again if the system meets a 
saddle in which the symmetry is reestablished. We will illustrate how this 
phenomenon affects the exit distribution. 

Planar heteroclinic network with two nodes 

Let us give a concrete example. Consider the system in which b has two crit- 



ical points {Ci, C2} and phase space of the flow S is as depicted in Figure 1.4 
with D being a rectangle around the two critical points. Let X^{0) = xq be 
on the locally stable manifold Wf of Ci- Consider = n Bs{Ck), 
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Figure 1.4: Heteroclinic network example 



where Bs{(^k) is a ball of sufficiently small radius around and the 
locally unstable manifold of (k. There is an orbit connecting Ci with 



lim S q-i 

t — ^oo 



C2. 



Recall from Section [1. II that as e — )• the exit distribution concentrates near 
the minimizers oiV{x,-) over the boundary of dD. For a heteroclinic net- 
work this means that the exit concentrates at all points in the boundary 
that can be reached from x along a sequence of heteroclinic connections. In 
this case, these points are 



yi 



lim 5*^1+, 

t — s>oo 



2/2 



lim S'q+, 

t — !>00 



and = lim^^oo S ■ Then, the exit measure will weakly converge, as 
e — > 0, to 

Pl^yi+P25y2+P?,^yi.^ (1-24) 
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where pi,P2 and ps are positive numbers that sum up to 1. In the spirit of 
Theorem [9] a direct apphcation of our results imply a scaling limit to this 
convergence. 

Lemma 16 Consider the system just described. There is a family of ran- 
dom variables (^e)e>0; a family of random vectors ((^e)e>o and the random 
variable ao such that P{0^ £ {1, 2, 3}} = P{ao £ (0, 1]} = 1, and 

for every e > 0. The random vector {9e,(pe) converges in distribution to 
(^o,0o). 

The random vector {9q, (/)q) in principle can be obtained explicitly. It is clear 



from (1.24) that pi = P{9o = i},i = 1,2,3. 



Let {A+, — A_} be the set of eigenvalues of V6(Ci) and {//+, — ^-} the set 



of eigenvalues of V6(C2) (see Figure 1.4). Using the three observations made 



at the beginning of Section 1.2.2 several cases can be considered (see [6] for 
further explanation): 

• If /x+ < , and A+ < A_ then the system is symmetric withpi = 1/2, 
P2 = P3 = 1/4, and oq = 1. Here symmetric means that the random 
vector (pQ has no bias along the direction of Wf if Oq = 1 or along the 
direction W" if £ {2,3}. Asymmetric means that there is a bias in 
any of the aforementioned cases. 

• If /^+A+ < ^-A_, and A+ > A_, the system is symmetric if S {2, 3}, 
strongly asymmetric if A-|- < A_ and 6q = 1, and asymmetric if A+ = 
A_ and = 1- Moreover, pi = 1/2, p2 = 0,P3 = 1/2, when A_ < A+, 
and pi = 1/2, p2 = 0,P3 = 1/2, when A_ = A+. The random variable 
ao is given by 

"0 = T^<^{i}(^o) +'5{2,3}(6'o)- 

^+ 

• If > ^-A_, and A+ > A_, the system is strongly asymmetric 
and pi = 1/2, p2 = 0,p3 = 1/2, and 

"0 = ^^{i}{do) +/^-A-/(/i+A+)(^{2,3}(6'o)- 



• If ;U4. = and A+ = A_, the system is asymmetric and pi = 1/2, 
P2 G {0,P3),P3 < 1/2, and ao = 1. 
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• If > and A-i- = A_, the system is asymmetric 9q = 1 and 
strongly asymmetric otherwise. Moreover, pi = 1/2, p2 G (0,^3), < 
1/2, and 

"0 = S{iy{9o) + (/i_/^+)()'|2,3}(6lo). 

• If /i+ = and A+ > A_, the system is strongly asymmetric if ^0 = Ij 
and asymmetric otherwise. Moreover, pi = 1/2, p2 = 0,P3 = 1/2, and 
"0 = A_/A+. 

A formalization of this argument based on a weak convergence result is 
done in [3] . In such, the limiting behavior of the rescaled process 

=X,(tlog(e-i)) 

is obtained. Notice how this rescaled process instantaneously jumps along 
saddles. Hence if a weak convergence result has to be established, we need 
to introduce a new topology. Indeed, the standard Skorokhod topology does 
not allow to capture the curves along which the jumps are made. We state 
the weak convergence result in the next section. 

Weak convergence result 

In order to present the weak convergence result for the rescaled version of 
X^, we need to introduce a new topology. 

Consider all paths 7 : [0, 1] — )• [0, 00) x such that the first coordinate 
7*^ is nondecreasing. Equip the space of paths with the equivalence relation 
~, where 71 ~ 72 if and only if there is a path 7* and non-decreasing 
surjective functions Ai, A2 : [0, 1] — )• [0, 1] such that 7^ = 7* o A^. The set of 
curves X is the quotient of the space of paths with the equivalence relation 
~. Actually the set X can be regarded as a Polish space: 

Lemma 17 ( [4j) X can be made into a metric Polish space with distance 
function 

p{Ti,T2)= inf sup |7i(s) -72(s)|- 
7iGri,72er2 se[o,i] 

Refer to [1] for more information about this sapce. 

In order to state the result, we give a non-technical introduction to the 
notion of entrance-exit maps introduced in [2]. Let V be the set of proba- 
bility measures in M^, define out = (0, 00) x [0, 1] x x (0, 1] x V and 

Outk = {((t_,p_,x_,/3_,F_), (t+,p+,x+,/3+,F+)) e out^ : 

t_ = t+,x± = q^,P- +P+ = = 

Then we have the following definitions. 
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Definition 18 For each k, an entrance-exit map is a map 



where the domain of^k satisfies some regularity assumptions (see f4\ page 
10] ) . We denote = (^^ , )• 

Definition 19 Suppose xq G W|, for some 1 < k < N . The sequence 
z = (^0) -^ii, ■■■,Or-i, Zi^, ^r) is admissible for xq (refered as XQ-admissible) if 

1. Oq is the orbit of xq with S^xq Zi^, as t ^ oo; 

2. for each j £ {1, ..,r}, 6j is either the orbit of qf, or the orbit q~; 

3. for each j £ {l,..,r}, ij+i is either nf, or n~ according to whether 
thetaj is the orbit of qt or q^ . 

With each admissible sequence z we associate the sequence 

r7(z) = {{x'Q,aQ,^lQ),{ti,pi,Xi,ai,li.i),...,{tr,Pr,Xr,ar,llr)), 

where x'q = S^'^^^^xq, oq = 1, 

t'ixo) = M{t > : S'xo G BsiCh)}, 
and the rest of the entries are given by 



To each admissible sequence z we can associate a piecewise constant 
curve r(z) by identifying it with the path of curves such that spend time tj 
at the point xj and jump to the next point along the path 6j. Also we can 
associate probabilities through the relationship 



Note how the set of all admissible sequences for xq has the structure of 
a binary tree partially ordered by inclusion. We say that a set of admissible 
sequences L of xq is free if no two sequences of L are comparable with respect 
to this partial order. Additionally, if any sequence not in L is comparable 
to one sequence from L then L is called complete. It is clear that for any 
free set tt{L) := YlzeL '^('^) — 1' 'while for a complete set vr(L) = 1. 

The main theorem is: 




7r(z) =pi...pr. 
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Theorem 20 Suppose that ^^(O) = xq is in the heteroclinic invariant. For 
each e > 0, define the process Z^{t) = Xe{t\ log(e)|). Then, for any conser- 
vative set L of XQ-admissible sequences, there is a family of stopping times 
(Te)e>o such that the distribution of the graph T z^[t),t<T^ converges weakly 
in (X, p) to the measure M^q^l concentrated on the set 

{r(z) : z G L} 
and satisfying Mx^{o),l{^(.'^)} = ^(z)- 

Contributions made in the case S admits a heteroclinic network 

In this section we outline our contribution for the case in which S admits 
an heteroclinic network. 



The iteration procedure described in Section 1.4. 2| was first proposed 
in [Ij. It is the central idea in proving the main results in [Ij. This iteration 
is carried out in [4J by using an equivalent version of Theorem [oj This 
version was proved under the hypothesis that the non-linear system can be 
locally conjugated to a linear system by a transformation. That is, in ^ 



Theorem 20 is proved under the following hypothesis: 

Condition 21 At each critical point C, ^ Z there are non-resonant condi- 
tions. 



As discussed in Section |1.2| this is in general not the case, and examples of 
saddle point that do not satisfy this condition are known [48] . In this work, 



we completely remove condition 21 in the 2-dimensional situation 



On the other hand, observe that the iteration procedure is Section 1.4.2 
is based on the computation of a random map. This map is such that, for 
a domain V , to any given initial distribution of the diffusion it gives 
the exit distribution of from V . We call this map a random Poincare 
map. For 1/ C M^, let Ily be the set of probability measures with sup- 
port on V . Then, the random Poincare map for D, T/j : 11/) — t- Hg/j is 
the (deterministic) map such that T/jQ = PQ{Xe(TjP}, where Pq is the 
original probability measure conditioned on ^^(O) being distributed as Q. 



The iteration in Section |1.4.2 illustrates the use of this map. Notice that 
this methodology applies regardless the type of equilibria that the system 
exhibits. We base our proof of Theorems [9| and |13| on a similar idea. Hence, 
it is worth to study small noise perturbations with this direction in mind. 



As an example of a Poincare map, consider our example in Section 1.4.2 



The exit distribution is the composition of the Poincare maps T/jg o ... o 



^Dx^xQ^ where Di are ilustrated in Figures 1.5 and ??, and we are condi- 
tioning on exit along 1/3. 
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Figure 1.5: Illustration of the domains Di, D2, and used to compute the 
Poincare maps in the case of a heteroclinic network with 2 nodes conditioned 
on exit along 7/3: escape from the first saddle. 
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Figure 1.6: Illustration of the domains D3 and D4 used to compute the 
Poincare maps in the case of a heteroclinic network with 2 nodes condition 
on exit along y-y. escape from the second saddle. 
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1.5 General Setting 

The objective of this section is to estabhsh the general setting and notation, 
although each chapter has the necessary modifications and additions to the 
following. 

Let {Q, J^, P) be a complete probability space (every subset of every mea- 
surable null set is measurable) and Vl^ be a d-dimensional standard Brownian 
Motion on it. Let {J-f^)t>o be the filtration generated by W which satisfies 
the usual hypothesis [54, Section L5] . We assume that ($7, J^, P) is rich 
enough to accommodate a family of random vectors {^e)€>o in I^"^ such that 
the sigma algebra generated by is independent of for each e > 0. For 
each e > 0, we consider the left continuous filtration 

as well as the collection of null sets 

No = {Z cn:3Geg'^ with Z C G and P{G} = 0}. 

Let us create the augmented filtration = (j{Ql U Nq) for t G [0, oo), and 
= a{Dt>oJ^i)- It can be shown that W is a brownian motion with 
respect to {Fi)t>o-, the path of W is independent of and {Fl)t>Q satisfies 
the usual hypothesis, for every e > 0. 

Throughout the text, we suppose that the family of random variables 
(Ce)e>o satisfies — )• in distribution, and that .^^ has a finite second 
moment for each e > 0. 

Consider a C°°-smooth vector field h on M"' and a C'^-smooth matrix 
valued function o" : M*^ — )• M'^^"'. Consider the Ito stochastic differential 
equation 

dX, = b{X,)dt + ea{X,)dW (1.25) 
equipped with initial condition 

X,(0)=xo + e%, (1.26) 

where a G (0, 1]. Hypothesis regarding the point xq G M*^ will be given in 
each chapter. We assume that both b and a are uniformly Lipschitz and 
bounded, i.e., there is a constant L > such that 

\a{x)-a{y)\V\b{x)-b{y)\ < L\x - y\, x,y£R^, 
|cj(x)| V |6(x)| < L, xGM^ 
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where | • | denotes the EucHdean norm for vectors and Hilbert-Schmidt norm 
for matrices. Further assume that the matrix function a = era* is uniformly 
positive definite. These conditions can be weakened, but we prefer this 
setting to avoid multiple localization procedures throughout the text. These 
assumptions imply [411 Theorems 5.2.5 and 5.2.9] that equation (1.25) has 
a strong solution with strong uniqueness on the filtered probability space 
(f], J^, P, {J^t)t>o) with initial condition (1.26) for each e > 0. Let us recall 
the definition of strong uniqueness and strong solution for completeness. 



Definition 22 A strong solution of the stochastic differential equation (1.25) 
with initial condition (1.26) on the filtered probability space (0, J^, P, {J~t)t>o) 
is a process = {Xf:{t); <t < oo} with continuous sample paths and with 
the following properties: 

1. Xf: is adapted to the filtration {J-^)t>o, 

2. P{X,(0) = xo + e°Cj = 1, 

3. V{j'^{\h{X,{s))\ + a^^j{X,{s)f)ds < oo} = 1 for every 1 < i,j < d 
and t > 0, 



4- the integral version of (1.25) 



X,{t)=Xe{0)+ [ h{X,{s))ds + e I a{X,{s))dW {s)]Q <t < oo, 
Jo Jo 



holds with probability 1. 



Given two strong solutions X^ and X^ of (1.25) with initial condition (1.26) 
relative to the same brownian motion W . Then, we say that strong unique- 
ness holds whenever P{Xe(t) = X^(t);0 < t < oo} = 1. 

For a general background on stochastic differential equations see, for exam- 
ple, [m Chapter 5]. 

The flow generated by b is denoted hy S = ('S'*x)(j ^.-jgjgxK'*- That is, S^x 
satisfies 

^S^x = b(S^x), S^x = x. 
dt 

The linearization of S along the orbit of x is denoted by ^x{t)- 
d 



dt 



^x{t) = A{t)^S), ^x(O) =/, 



:i.27) 
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where A{t) = 'Vb(S^x) and / is the identity matrix. Here V is the derivative 
operator, that is, for a differentiable vector field /i : M"^ — t- M*^, Vh is the 
]^dxd jj^atrix derivative of h. 

Throughout the text D is a domain (open, connected and bounded) in 

with piecewise boundary. 

The exit problem for the diffusion process in D is studied. We are 
interested in the joint asymptotic properties (as e — 0) of {X^{tP),tP), 
where r/^ is the stopping time defined by 

rf = rf (xo) = mf{t > : X,{t) G dD}. 



Specific hypotheses on the vector field will be given in each chapter. On 
the other hand, the abstract formulation will not be given in each chapter, 
instead we assume this technical formulation to hold throughout the text. 



1.5.1 Organization of the Text 

The organization of the rest of the text closely mimics the presentation given 
in this chapter. 

In Chapter [2] the planar (i.e. d = 2) exit problem is studied under the 
assumption that S has a unique saddle at the origin. That is, S M'^ is the 
only critical point 5(0) = and the eigenvalues A+, — A_ of the of the matrix 
V6(0) are such that X± > 0. The exit problem is studied conditioned that 
the process X^ starts on the stable manifold of 0. 

In Chapter |3] the Levinson case in arbitrary dimensions is considered. 



We also proved Lemma 12 stated in this chapter, and use it intesively in the 



proof of the Levinson case. In this section, we also study the 1-dimensional 



example discussed in Section 1.4.1 of this chapter. 

In Chapter [4] we present a short survey on how the techniques in this 
text can be extended. Several open problems are also discussed. 



Chapter 2 

Saddle Point 



In this chapter we study the stochastic process when the underlying 
deterministic system 5 has a unique saddle point. 

In Section 2.1 we introduce the setting, which relies on the setting pre- 
sented in Section |1.5| of Chapter [l} In Section 2.2 we state the main theorem 
and split the proof into several parts. In Section 2.3 we introduce a simpli- 
fying change of coordinates in a small neigborhood of the saddle point. The 
analysis of the transformed process in Section 2.4 is based upon two results. 



Their proofs are given in Sections 2.5 and 2.6 



2.1 Setting 



For this chapter we consider the general formulation made in Section 1.5 of 



Chapter [T| except that we restrict ourselves to the 2— dimensional situation. 



The process is the strong solution of (1.25), under the assumptions made 
on the C°°-smooth vector field 6, the C^-smooth matrix valued function a : 
M? — )• M^^-^ and the standard 2-dimensional Wiener process W in Section 1.5 
of Chapter [Tj 

We will study the exit problem from the domain D C'M? with piecewise 
boundary. Assume that the origin belongs to D and it is a unique 

fixed point for 5* in D, or, equivalently, a unique critical point for b in D. 

Therefore, 

b{x) = Ax + Q{x), 

where A = V6(0) and Q is the non-linear part of the vector field satisfying 
\Qix)\=O{\x\^),x^0. 

Suppose that is a hyperbolic critical point, i.e. the matrix A has two 
eigenvalues A+ and — A_ satisfying — A_ < < A+. Without loss of gen- 
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erality, we suppose that the canonical vectors are the eigenvectors for the 
matrix, so that A = diag(A+, — A-). 

For an interval J C M, let S'^x denote the set 

S-^x = {S^x :teJ}. 

According to the Hadamard-Perron Theorem (see e.g. \53\ Section 2.7]), the 
curves W'^ and defined via 

W" = {x G Z) : lim S^x = 0, and for some s > 0, S'-'^^'^^x C D and S^''°°^x nD = 

i— oo 

and, 

W = {xGD: lim S^x = 0, and for some s < 0, S^-°^''^xnD = and S'^^'^^^x C D} 

are smooth, invariant under S and tangent to 62 and, respectively, to ei at 
0. The curve is called the stable manifold of 0, and is called the 
unstable manifold of 0. 

We assume that intersects dD transversally at points g+ and q- 
such that the segment of W" connecting g_ and g+ lies entirely inside D 
and contains 0. 



We fix a point xq G W H D and equip (1.25) with the initial condition 



X,(0) = xo + e%, e>0, (2.1) 

where a G (0, 1] is fixed, and {^e)t>o is a family of random vectors inde- 
pendent of W, such that for some random vector ^Qj ~^ '^0 as e — t- in 
distribution. 

If a 7^ 1, then we impose a further technical condition 

P{eo II b{xo)} = 0, (2.2) 
where || denotes collinearity of two vectors. 

2.2 Main Result. 

The main result of the present chapter is the following: 

Theorem 23 In the setting described above, there is a family of random 
vectors (^e)e>o, a family of random variables {ipe)e>o, and a number 

f 1, aA_ > A+ ^ ^ 

13=1 xl , ; X 2.3 
1 a^, aX- < A+ 
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such that 

The random vector 



converges in distribution as e — )• 0. 

The distribution of ipejff'e, and the distributional hmit of ©e wih be de- 
scribed precisely. 

The proof of Theorem [23] has essentially three parts involving the analysis 
of diffusion (i) along W^; (ii) in a small neighborhood of the origin; (iii) 
along W". 

In order to study the first part, we need to introduce ^xit) as the lin- 
earization of S along the orbit of x G M^, i.e. we define ^x{t) to be the 
solution to the matrix ODE 

j^^xit) = A{t)^x{t), $^,(0) = /, 

where A{t) = V6(S*3;). We have the following theorem: 

Theorem 24 Let x G 'M? and {Ce)e>o be a family of random vectors in- 
dependent of W and convergent in distribution, as e — )• 0, to (^q. Suppose 
a £ (0, 1] and let be the solution of the SDE (1.25) with initial condition 
^^(O) = x + e"^e- Then, for every T > 0, the following representation holds 
true: 

X,(T) = 5^x + e%, e>0, 

where 

■z Law r „ 

— >^o, e 0, 

with 

Co = $x(T)eo + l{a=l}N, 
N being a Gaussian vector: 

iV = $,(r) r <^>xis)-'aiS'x)dWis). 
Jo 



If a = 1 or assumption (2.2) holds, then P{^q \\ b{S x)} = 0. 
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The second part of the analysis is the core of the chapter. Theorem 25 
below describes the behavior of the process in a small neighborhood U of 
the origin. Notice that since xq G W^, one can choose T large enough to 
ensure that that S^xq S H U. Therefore, the conditions of the following 



result are met if we use the terminal distribution of Theorem 24 (applied to 



the initial data given by (2.1)) as the initial distribution. 



Theorem 25 There are two neighborhoods of the origin U d U' <Z D , two 
positive numbers S < 6', and dijjeomorphism f : U' ^ {—6', 6')'^, such 
that f{U) = {—6, 6)"^ and the following property holds: 

Suppose X G n U, and (^e)e>o is a family of random variables in- 
dependent of W and convergent in distribution, as e — 0, to where 



satisfies (2.2) with respect to x. Assume that a G (0,1] and that 



solves (1.25) with initial condition 



X,(0) = x + e"e., 



(2.4) 



where satisfies condition (2.2) with respect to x. 



There is also a family of random vectors (0^)e>o, and a family of random 
variables ('0^)e>o, such that 



X,(rf) = r7(sgn(V:)5ei)+e 



where g = f , f3 is defined in (2.3), and the random vector 



a 



converges in distribution as e — )• 0. 

The notation for Q[ and its components is chosen to match the notation 
involved in the statement of Theorem 23 Random elements ip'^yfp'e ^-^d the 



distributional limit of 0'^ will be described precisely, see (2.30). Obviously, 



the symmetry or asymmetry in the limiting distribution of ip'^ results in 
the symmetric or asymmetric choice of exit direction so that the exits in the 
positive and negative directions are equiprobable or not. On the other hand, 
the limiting distribution of (p[ determining the asymptotics of the exit point 
can also be symmetric or asymmetric which results in the corresponding 
features of the random choice of the exit direction at the next saddle point 
visited by the diffusion. 



In Section 2.4 we prove Theorem 25 using the approach based on normal 
forms. 
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The last part of the analysis is devoted to the exit from D along W". 
We need the following statement which is a specific case of the main result 
of Chapter [3j 



Theorem 26 In the setting of Theorem 24-, assume additionally that (i) 
q = S'^x € dD; (ii) there is no t ^ [0,r) with S^x € dD; (Hi) b{q) is 
tranversal (i.e. not tangent) to dD at q. Then 



tF 4 T, 



(2.5) 



and 



Law 



0, 



(2.6) 



where vr denotes the projection along b{q) onto the tangent line to dD at q. 



Now Theorem 23 follows from the consecutive application of Theorems 24 



through 26 and with the help of the strong Markov property. In fact, in this 



chain of theorems, the conclusion of Theorem 24 ensures that the conditions 



of Theorem 25 hold, and the conclusion of the latter ensures that the con- 
ditions of Theorem [26] hold. Notice that the total time needed to exit D 
equals the sum of times described in the three theorems. Notice also that 
at each step we can compute the limiting initial and terminal distributions 
explicitly. Theorems [24] and [26] contain the respective formulas in their for- 
mulations, and the explicit limiting distribution for 0^ of Theorem 25 is 



computed in (2.30) 



2.3 Simplifying change of coordinates 

2.3.1 Smooth Transformation and Normal Forms 

In this section we give a brief review of the theory of Normal Forms. In par- 
ticular, we focus on the neighborhood of a saddle point for the deterministic 
flow S. 

The idea is to find a local change of variables ^ : — )• such that 
z{t) = 9{S^x) satisfies i = Az with the appropriate initial condition. First, 
note that z satisfies the equation 



—z{t) = V9{S^x)b{S*x) 

= Ve{z{t))-\9-\zm, z{0) = 9{x). 
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Hence, the goal is to find a transformation : — )• that leaves V0{z)^^b{9' 
as simple as possible (ideally equal to Az). 

We start with some notions. For a multi-index a = (01,02) G ^+ and 
a base {61,62} of M^(as a vector space over M) we denote the monomial 



•ju — 1 2 I' 

Definition 27 For a non-negative integer r, the space of linear combina- 
tions (overM.) of monomials x°'ei with \a\ = ai -\- 02 = r , is called the space 
of Homogenous Polynomials in 2 variables of degree r. This space is denoted 
as Tir- In other words, Hr is, 

Tir = span]g ■ G Zi^, |a| = r and 1 < j < 2} . 

It is easy to see that Tir is isomorphic (as a vector space over the real 
numbers) to M2(''"+i). 

Using this notation, use Taylor's classical theorem to decompose the 
function 6 : ^ as 

biz) = Az + b2{z) + ... + bR{z), (2.7) 

with bi £ Hi ioi 1 < i < R, and 6/j(x) = 0{\x\^) as |x| 0. 

Suppose that z = Ok{C), where 9^ is the near identity transformation 

ek{C) = C + hk{C), hk£Hk, k>2. (2.8) 

Note that 9k is a topological diffeomorphism in a small open neighborhood 
of the origin 0^. Throughout we restrict the analysis inside Qk- A Taylor 
approximation shows that the inverse of 9k satisfies 

Ok\C) = C-hk{0 + o{\C\''-') 

= C-hk{C) + 0{\C\''+'). (2.9) 

Further application of Taylor's approximation together with the condition 
that k >2, imply that for any G O^, 

V9k{cr' = i-vhk{o+o{\c\^^''~'^) 

= /-v/ifc(c) + o(|c|'=). 



Also, from (2.9), we obtain that for any « = 1, ...,R — 1, 

'(C)) = 6.(0 + 0(1(1'=+'). 
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Using ( |2.7| ) and this bounds, we get that 

{ve,{c))-'b{0^HO) = + 62(C) + - + &fc-i(C) 



+ (6,(C)-£i/ifc(C)) + 0(|Cl'+'), 
where we defined the operator C\ : Tik T-Lk by 

C\h{Q) = h{C)AC-AVh{Q). (2.10) 
It is clear that the following theorem holds: 

Theorem 28 Let Tl{C\) C Tik be the range of the operator L\ : Ji^ — ^ 'Hfc. 
Take C he any subspace such that T-Lk = T^{^a) ®^k- Then, there 
is a sequence of near identity transformations of the form (2.8) and nested 
neighborhoods of the origin i^k+i C 0,^, such that z(t) = Oj. o ■ ■ ■ o 92{S^x) 
satisfies 

j^z{t) = Az{t) + b2{z{t)) + ... + br{z{t)) + 0{\zr+'), 

inside Qr, and b^ G Ik, k = 1, r. 

An equation written in this form is said to be in Normal Form up to 
order r. 

The idea is to characterize the image of the operator £^ and simplify 
each non-linear part of b, starting from 62 and all the way up to In order 
to achieve this, we remark that, x'^ej is an eigenvector of C'^ for any a E Z^: 

£^x"ej = (A^a- Aj)x"ej, 

for A = (A+,A_). This motivates the following definition: 

Definition 29 A pair of complex numbers A = (Ai, A2) is said to be non- 
resonant if there are no integral relations between them of the form Xj = a-X, 
where a = (ai, 02) € is a multi-index with \a\ = ai + a2 > 2. Otherwise, 
we say that X = (Ai,A2) is resonant. 

A resonant X is said to be one-resonant if all the resonance relations for 
X follow from a single resonance relation. 

A monomial called a resonant monomial of order R 

if a ■ X = Xj and \a\ = R. 



In the spirit of Theorem 28 it is clear (see , \m\ ) that for any pair of 
integers R> 1 and A; > 1, there are two neighborhoods of the origin 0,f and 
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and a C'^-diffeomorphism f : Q,f ^ Qg with inverse g : rip — )• such 
that 

{Vg{y))-'b{g{y)) = Ay + P{y)+TZ{y), y G (2.11) 

where P is a polynomial containing only resonant monomials of order at 
most R and Tl{C) = 0(|C|'^^^)- Moreover, the so called Poincare theo- 
rem |20| Theorem 2.2.4] asserts that if A is non-resonant, then / can be 
chosen so that both P and TZ in ( 2.11| ) are identically zero. If A is one- 



resonant then [38, Theorem 3, Section 2] says that / can be chosen so that 



TZ in (2.11) is identically zero. More precisely: 



Lemma 30 For any k > 1, there are two neighborhoods of the origin 0/ 

and and a -diffeomorphism f : Q.f ^ Q.g with inverse g : Qg 0,f 
such that 

{Vg{y))-H{g{y)) = Ay + P{y), y^Qg, (2.12) 

where P is a polynomial that contains only resonant monomials. 

This is the core result we use to study the stochastic case in the next 
section. 



2.3.2 Change of Variables in the Stochastic Case 

In this section we start analyzing the diffusion in the neighborhood of the 
saddle point. The first step is to find a smooth coordinate change that would 
simplify the system. This can be done with the help of the theory of normal 
forms presented on the last section. 

Let 5 be a C°°— diffeomorphism of a neighborhood of the origin with 
inverse /. When is close to the origin and belongs to the image of that 
neighborhood under g, we can use Ito's formula to see that = /(^e) 
satisfies 

dY, = Vf{X,)dX, + ^[V/(X,),X,] 

= Vf{g{Y,mg{Y,))dt + ed{Y,)dW + e^^{Y,)dt, 

for some smooth function : — >• and a = ((V/) o g) a. Here the 
square brackets mean quadratic covariation. Since V/o g = (V^)"^, we can 
rewrite the above SDE as in the deterministic case as 

dY, = (^{Vg{Y,))-^ h{g{Y,)) + e^iY,)) dt + ea{Y,)dW. (2.13) 

In order to simplify the drift term in this equation, we rely on Lemma [30l 
First, note that (A4.,— A_) is either non-resonant or one-resonant (resonant 
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cases that are not one-resonant are possible in higher dimensions where pairs 
of eigenvalues get replaced by vectors of eigenvalues). The non-resonant case 
(in any dimension) was studied in 0]. In this paper, we extend the analysis 
of [3] to the non-resonant case, i.e. the one-resonant case, given that we are 
working in 2 dimensions. 

To find all resonant monomials of a given order r > 2, we have to find 
all the integer solutions to the two 2x2 systems of equations: 

QiA+ — a2X~ = iA-t, 
ai + "2 = r. 

Therefore, the power multi-indices of a resonant monomial of order r has to 
coincide with one of the following: 

{at{r),at{r)) = ^-1^(A+ + rA_, (r - 1)A+), (2.14) 

(ar(r),a2(r)) = ^--l^((r - 1)A_, rA+ + A_), (2.15) 

Let us make some elementary observations on integer solutions of these 
equations for r > 2. 

1. None of the solution indices can be 0. Moreover, neither af{r) nor 
a2{r) can be equal to 1. 

2. As functions of r, af{r) are increasing. 



3. Expressions (2.14) and (2.15) cannot be an integer for r = 2. 



4. The term P = (Pi, P2) in < \2.12\) satisfies Pi(y) = 0{yf\y2\) and P2(y) = 
0(|yi|y|). This observation is a consequence of observations 1 and 3 
since they imply that resonant multi- indices have to satisfy a^{r) > 
(2,1) and a'(r) > (1,2) coordinatewise. 

5. If at least one of the coordinates yi and y2 is zero, then P(yi, 2/2) = 0. 
This is a direct consequence of the previous observation. 

Given all these considerations, the main theorem of this section is a simple 



consequence of Lemma 30 



Theorem 31 In the setting described in Section \3.1\ there is a number 
6' > 0, a neighborhood of the origin U' , and a C"^ -diffeomorphism f : U' ^ 
{—6', 5') with inverse g : {—6', 5')'^ — )■ U' such that the following property 
holds. 
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If X^{0) £ U , then the stochastic process = (Y^^i,Y^^2) given by 

y,(i) = /(X,(tArf)) 

satisfies the following system of SDEs up to : 

= iX+Ye,i + Hi{Y„ e)) dt + eaiiY,)dW (2.16) 
dy,,2 = (-A-y.,2 + H2iY„ e)) dt + ea2iY,)dW, (2.17) 

where a-i : {—6', 5')'^ — t- M are functions for i = 1,2. T/ie functions Hi 
are given by Hi = Hi + e^^j, where "^i : {—6', 6')'^ — )• are continuous 
bounded functions, and Hi : {—6', 5')'^ x [0,oo) are polynomials, so that for 
some constant Ki > and for any y G {—6', 6')'^, 

|i?i(y)| <i^l|ylr^|2/2r^ 

\H2{y)\ < Ki\yi\"^ \y2r^ . 
Here, the integer numbers af , i = 1,2, are such that {af , a^) is of the form 



(2.14) for some choice of r = ri > 3, and and (a-j^ , 02 ) is of the form (2.15) 



for some choice r = r2 > 3. In particular, 

\Hi{y,e)\ <Kiyl\y2\+K2e^, 
\H2{y.e)\ <Ki\yi\yl + K2e^, 

for some constants Ki > and K2 > 0. 



2.4 Proof of Theorem 25 



In this section we derive Theorem 25 from several auxiUary statements. 



Their proofs are postponed to later sections. 

Theorem |31| allows to work with process Y^ = f{X^) instead of while 
stays in {-5',5'Y 

If we take 5 G (0, 5'), then for the initial conditions considered in Theo- 
rem 



25 and given in ( |2.4[ ), 

P{X,(0) G [/'} ^ 1, e ^ 0, 

i.e., 

P{ye(0) G (-y,y)2}^ 1, e^O. 
Moreover, denoting f{x) hy y = (0, ^2) we can write 

y,(0) = y + e"xe = (e"x.,i> ^2 + e"x.,2), e > 0, 
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where Xe = (Xe,iiXe,2) is a random vector convergent in distribution to 
Xo = (xo, 15X0,2) = V/(x)^o- Due to the hypothesis in Theorem 25, we 
notice that the distribution of xo,i has no atom at 0. 
Let us take any p £ (0,1) such that 

A , ^ 

and define the following stopping time: 

f, = inf{t:|y,,i(t)|=e"P}. 

Up to time f^, the process mostly evolves along the stable mani- 
fold W*. After fe, it evolves mostly along the unstable manifold W". Pro- 
cess evolves accordingly, along the images of and W" coinciding with 
the coordinate axes. 

Let us introduce random variables rjf^ via 

rit = e-°e-^+^^y,,i(f,), 
r?7 = e-(i-f)^-/^+y,,2(r,). 

Also we define the distribution of random vector {r]Q , tjq ) via 

ryo+ = xo,i + l{a=i}N+, (2.19) 
Vo = I r/J 1^-/^+2/2, 

where 

poo 

N+ = e-^-'di{0,e-^''y2)dW (2.20) 
^0 

is independent of xo 1 • 



Lemma 32 If the first inequality in (2.18) holds, then 

P{ye,i(re) = e"Psgn?7+} ^1, e ^ 0. (2.21) 

and 

'nt,v7:T'e + j^{'^-p)^oge^ ^ |^r/+,r/o",--^log|r?+|^ , e ^ 0. 

(2.22) 
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We prove this lemma in Section 2.5, Along with the strong Markov 
property, it allows to reduce the study of the evolution of after fe to 

(2.23) 



studying the solution of system (2.16)-(2.17) with initial condition 
where 



(e"P sgn r/+ , e^^^'P)^- ry^ ) , 



Law 



0. 



We denote 



r, = r,(5)=inf{t>0:|y,,i(t)|=5}. 



(2.24) 



(2.25) 



Our next goal is to describe the behavior of Y{Te). To that end, we introduce 
a random variable 9 via 



Law 



5 



2/2 + N, qA_ = A+, 
y2, aX- < A+. 



(2.26) 



where the distribution of N conditioned on r]^, on {sgnryfj" = ±1} is centered 
Gaussian with variance 



„2A_ 



a2(±<5e^+^0) 



ds. 



Let us also recall that (3 is defined in (2.3). 



Lemma 33 Consider the solution to system (2.16)-(2.17) equipped with ini- 
tial conditions (2.23) satisfying (2.24). If the second inequality in (2.18) 
holds, then 

P{|y,,i(re)| = 5} ^ 1, e^O, (2.27) 



r, + — log e — > — log d, 
A+ A+ 

Moreover, if (3 < 1, then the convergence in probability also holds. 



(2.28) 
(2.29) 



A proof of this lemma is given in Section 2.6 



Now Theorem 25 follows from Lemmas 32 and 33 In fact, the strong 
Markov property and ( |2.21 ) imply 

P{rf = f, + T,(5)}^l, e^O, 
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so that the asymptotics for is defined by that of and t^{6). It is also 
clear that one can set ip'^ = rjf, and (j)'^ = V g{sg'ii{ri^)5ei)Ye{Te) , so that the 
limiting distribution of 0^ is given by 



V5((sgn(??([)(5ei)(6le2), log -^r 



(2.30) 



where random variables t/q and 9 are defined in (2.19) and (2.26) 



2.5 Proof of Lemma [32 



In this section we shall prove Lemma 32 using several auxiliary lemmas. We 
start with some terminology. 



Definition 34 Given a family (^e)e>o of random variables or random vec- 
tors and a function h : (0, oo) — )• (0, oo) we say that = Op(/i(e)) if for 
some eo > distributions of ('?e/^(e))o<e<eoi form a tight family, i.e., for any 
5 > there is a constant Ks > such that 

P m > Ksh{e)} < 6, 0<e<eo. 

Definition 35 A family of random variables or random vectors (Ce)e>o is 
called slowly growing as e — )• (or just slowly growing) if = Op{e~^) for 
ah r > 0. 



Our first lemma estimates the martingale component of the solution of 



SDEs ( |2l6l ) and ( |2l7| ). Let us define 

ft 



StiT) 

s;{T) 



sup 

t<T 



sup 

t<T 



diiY,{s))dW{s] 







, T>0, 



-^-it-^)a2{Y,is))dW{s) 



T > 0. 



Lemma 36 Suppose {Te)e>o is a family of stopping times (w.r.t. the natural 
filtration of W ) . Then 

St{n) = Op{l). 

If additionally {Te)e>o is slowly growing, then S~{Te) is also slowly growing. 
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Proof. Let us start with the proof for S^. Use BDG inequahty (see [H 
Theorem 3.3.28]) and Ito's isometry to see that for every constant K > 0, 



1 



I'{Stin)>K}<j^BSt{T, 



Since Ye{t) = f{Xe{t A t^)), the process t i— ai{Ye{t)) is almost surely 
bounded. Hence, integrability of the exponential t i— t- e~^^+* implies that 
for any S > 0, there is a Ks > such that 

snpP {St in) > Ks} <S, 

proving the first part of the lemma. 

For the second part, fix 5 > and r > 0. For every < p < 2r, there is 
Kp > and eo > such that 

sup P{ePT, > Kp} < 6/2. 

0<e<eo 

Then, for an arbitrary A' > 0, < e < eo and < p < 2r, it holds that 
P {e'S-{T,) >K}<P{t,> e~PKp] + P {e' S^ {t,) >K,t,< e-f'Kp} 

ft 



<6/2+ V P<^e^ sup / 

[ {k-l)<t<k Jo 



,-\-{t-s)~ 



a2{Y,{s))dW{s) 



> K 



In order to bound each probability in the last sum, proceed as for the other 
case: 

ft 



P I sup 

y {k~-l)<t<k Jo 

< P |e'^e~(''~^)^- 

g2rg-2(fe-l)A_ 



-\-{t-s) 



a2{Y,{s))dW{s) 



> K 



sup 

0<t<k 



e^-'a2{Y,{s))dW{s] 



> K 



< 



-E 



a2(Y,is))\'ds 



< 



for some constant C2 > 0. Hence, there is a constant C3 > such that 

P {e'-S-in) >K}< 6/2 + ^e^'-" 
which implies the result and finishes the proof. 
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Lemma 37 Suppose is the solution of equations (2.16)-(2.lD with ini- 
tial conditions given by 



y,,i(0) = e°x.,i and ^.,2(0) = 2/2 + e"Xe,2, 



(2.31) 



where distributions of random variables (Xe,i)e>o o-nd {Xe,2)e>o form tight 
families. Let us fix any R > and denote l^ = A (— ^ loge + R) f 
e > 0. Then 



or 



and the family 



supe-^*|ye,iWI =Op(e"), 



e-"sup|y,,2(t)-e-^*(y2 + e"x.,2) 
t<i. 



e>0 



is slowly growing. 



Proof. The tightness property impUes that without loss of generahty we 
can assume that |Xe,2| < C for some constant C > and every e > 0. 



that 
and 



Let us fix 7 > 0. We can use Lemma 36 to take c = 0(7/3) > such 

P{5+(/,) >c}< 7/2, 



P{57(/,)>ce-n<7A 



where q is an arbitrary number satisfying < q < a. Let us introduce a 
constant K = (3c) V C and stopping times 

/3+ = inf |t > : e-^+*|ye,i(i)l > 2i^e°} , 
/3_ = inf {t > : |ye,2(t) - e-^-\y2 + e"xe,2)| > 2ire"-«} , 
/3 = /3+ A/3_ A/,. 



We start with an estimate for l^.i. Duhamel's principle for (|2.16|), Theo- 
rem 



31 and Lemma 36 imply that the estimate 



supe-^+*|y,,i(t)| <e"/^ + iri / e~^+'Y,,i{sf\Y,,2{s)\ds + K2^ + eSt{(3) 

t<l3 Jo ^+ 

< e'^K + Ki r e-^+^n,i(s)2|y,,2(s)|ds + + 

Jo -J 



(2.32) 



CHAPTER 2. SADDLE POINT 



51 



holds with probabiUty at least 1 — 7/2. We analyze each term in the RHS 
of equation (2.32). 

Let us start with the integral in ( 2.32| ). For s < /3, we see that 



Therefore 

r/3 







< i<:e°/12 + 5KiK2e2«|y2| / e'^^+'^'^ds 

Jo 

(2.33) 



for ah e > small enough. Notice that this is a rough estimate, the constants 
on the r.h.s. are not optimal but sufficient for our purposes. This also applies 
to some other estimates in this proof. 

Let us estimate the integral on the r.h.s. of (2.33). When A+ > A_, the 
integral is bounded by 



1 



A4- - A- 



,(A+-A_)/3 < 



g(A+-A_)R 

A+ - A_ ' 



-Q! + Q:A_/A+ . 



if A+ < A_, then the integral on the r.h.s of (2.33 ) is bounded by (A_ — A+)~^; 
if A+ = A_, then the integral is bounded by 2aA^^| loge|. Hence, for some 
constant Kx^ X- > and e > small enough, 

Ki f e-^+'Y,^i{sf%^2{s)\ds < Ke"/12 + i^A+,A_e'"""^'"^-/^+^^|loge| 
Jo 

< Ke"/6. (2.34) 

Also, for e > small enough, 

K2e^/X+ + eK/3 < Ke"/2. (2.35) 

From ( |2.32D , ( |2.34D and ( |2.35| ) we get that for all e > small enough, the 
event 



A= < supe-^+*|y,i(i)| < 5i^e°/3 
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is such that P{A) > 1 - 7/2. 

Let us now consider l'e,2(t) and denote 

Z,(t)=y,,2(t)-e"^-*(y2 + e"x.,2 



Duhamel's principle for Y^^2, the definition of /3, Theorem 31 and Lemma 36 
imply that the inequalities 

sup|Z,(t)| < Kisup f e-^-^'-'^\Y,,i{sT^\Y,,2{sT^ds + K2eyX- + eS~{^) 

t<l3 t<l3 Jo 

< Kisup f e-^-^'-'^\Y,,i{sT^\Y,,2{sT^ds 

t<l3 Jo 

< 2"i"e""^/^"^Kisupe-^-* / e^^-+"^^+^'\Y,^2{sT^ ds + e''-'iK/2 

t<f3 Jo 

(2.36) 

hold with probability at least 1 — 7/2 and for all e > small enough. We 
analyze the integral term in (2.36). Note that, from the definition of /3, and 
the inequality (o + by < 2^-i(a^ + b'') we have that for any t < /3 and any 
e > small enough, 

\Ye,2(.tT^ < 2°2"-iZ,(t)"2" +2"2--ie-"2^-*|y2 + e"x.,2r2" 

< ga2'{a-9)22(«2 -1) ^2_ft:"2 + e^"^ |xe2r2^) + 2^("2 ~^)e~"2 1^2 . 
Hence there is a constant > such that 

\Ye,2{tT^ < ("-"^-fCa + K«e-"2"^-*, t < p. 

Using the last inequality, the definition of /3, and the fact q^A+ — — 
1)A_ = from Theorem 31, we get 



Jo 



A_ + A+ Jo 



A_ + A+ 



(2.37) 
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Again, from Theorem 31 we know that > 1 and 02 — 2 which together 



with (2.37) imply that for aU e > smah enough 



2°i e"°i Kisupe"^-* / e'^^-+''^ ^+'>'\Y,^2{sT^ ds < Ke"~y6. (2.38) 



Using (2.38) and (2.36) we conclude that the event 



is such that P(-B) > 1 — 7/2, for all e > small enough. 

The proof will be complete once we show that f3 = with probability at 
least 1 — 7. The latter is a consequence of the following chain of inequalities 
that hold for all e > small enough: 

P{/3+ A /3_ < Z J < P ({/3+ A /?_ < /e} n A n + PiA") + PiB") 
<P({/3+A/3_ <k}nAnB) + ^ 

< P ({/3+ < /3„ A /J n A) + P ({/?_ <f3+AQnB) + j 
= P{2 < 5/3} + P{2 < 2/3} + 7 = 7. 



Let us now analyze the evolution of the process up to time A . 
We start with an application of Duhamel's principle: 



Ye,2{t) = e-^-%2{0) + [ e~^-'^'-''>H2{Y,{s),e)ds + eN^it), 

Jo 

where N^{t) are defined by 



Lemma 38 



Kit) = / e-^+'ai{Y{s))dW{s), 
Jo 

N-it) = [ e'^-^'-'^a2iY,{s))dW{s). 
Jo 



sup|y,,2(t)-e-^-*y2| = Op(e"^) 

t<Tt 



(2.39) 
(2.40) 



(2.41) 
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Proof. Duhamel's principle, Theorem 31 , and the definition of imply 
that for some K > 0, 

\Ye,2(.t) - e~^-%\ < e"|xe,2| + fe-^-^'-'^ {Ki\Y,,i{s)\Y,\{s) + K^e^) ds + eS^ {t) 

Jo 

< e"|xe,2| + Ke'^P + {e^-^'P S- (n)) 



for any t G (0, fe). The result follows since by Lemma 36 the r.h.s. is Op(e"^) 



As a simple corollary of this lemma, the first statement in Theorem 32 
follows: 



Corollary 39 e 0, 



P{rf < fj ^ 0. 



In particular, (2.21) holds true. 



Lemma 40 Let 



Then 



N+{t)= f e-^-'ai{0,e-^-'y2)dW. 
Jo 



sup|iV+(t)-iV+(t)| 



0, e^O. 

Proof. BDG inequality implies that for some constants Ci,C2 > 0, 



Esnp\N^{t)-N+{t)\^ < CiE / e--'^+'\ai{Y,,i{s),Y,,2{s)) - {0, e~^-'y2)\''ds 

t<T^ 



< CaE sup I ai (y,,i (s) , y,,2 (s) ) - (0, e 

t<T^ 



From Lemma 38 and the definition of r^, it follows that 



sup 

t<T, 



(y,,i(t),y,,2(t)) - (0,e-^-*y2) = Opie^n- 



(2.42) 



(2.43) 



The desired convergence follows now from (2.42), (2.43), and the bounded- 
ness and Lipschitzness of ai . ■ 

We are now in position to give the first rough asymptotics for the time . 
From now on we restrict ourselves to the event {t^ > '^e} since due to 
Corollary |39| its probability is arbitrarily high. 



CHAPTER 2. SADDLE POINT 



55 



Lemma 41 As e ^ 0, 



a 



P <! f, > -— loge 



0. 



Proof. Let be the solution to the fohowing SDE: 

due{t) = X+Ue{t)dt + €ai(Ye{t))dW{t), 

Let us take 6o £ (0, 1) to be specified later and consider the fohowing stop- 
ping time 

= inf|t : \u,{t)\ =e"^o}. 
Duhamel's principle for writes as 

with 

Ut) = Xe,i + e'-"N+{t). (2.44) 
Hence, the definition of Te implies e"''" = e"e^+'^''-\rje{Te)\, so that 

re = -^(1 -(^o) loge - ^log|^,(f,)|. 



Due to (2.44) and Lemma 40, the distributions of ^ log |T/e(re)| form a tight 
family. Therefore, 



a 



limPiT, > -(1-5^) loge^ = 



A. 



(2.45) 



This fact allows us to use Lemma 37 to estimate up to f^txT^. From (2.39), 
the difference = Ye^i — Ue is given by 

A,(t) = e^+*/" e'^+'Hi{Y,{s),e)ds. 
Jo 



We can use ( 2.45 ) to justify the application of Lemma 37 up to time A r^. 



Then, we combine Theorem 31 , Lemma [37j and the definition of to see 
that 

sup e-^+'\Hi{Y,{t),e)\ < Ki sup ((e-^+*|ye,i(t)|) \Ye,i{t)\ ■ \Y,^2{t)\) + 



t<Tt/\Tt 

Op (e"+"P 
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and 



Op fe--(l-^o) 



These two estimates together with (2.45) imply 



sup |A,(t)| = Op (e"(P+^o)|loge|) . 



On one hand, (2.45) imphes 



fe > -J^ loge j n {fe < TelJ ^ 0. 

On the other hand, if > % then 

\Y,^i{t,)\ = e^-^o + Op(e"(P+^o)| logel 
and 

These relations contradict each other for sufficiently small e if we choose 
60 < p. So, this choice of Sq guarantees that P {fe > t^} — ^ implying the 
result. ■ 



Proof of Lemma 32 Recall that we work on the high probability event 
{fe < T^^}. Hence, for each e > 0, we have the identity 



Solving for and then plugging it back into l^^i, we get 

a 1 
= - — (1 -p) loge - —log |r/+|, (2.46) 
A+ A+ 

y,,i(f,) = e°?'sgn(77+). 

Using this information we are in position to get the asymptotic behavior of 
the random variables rj^. First, from relation (2.39) we get 

f-Te 



Ve =XeA 



+ e-" r e-^+'Hi{Y,{s),e)ds + ei-"iV+(f,). (2.47) 
Jo 



Using (2.46) in (2.40) we get 



1^, 



+|A-/A+ 



(2/2 + e"Xe,2) + |r?+|^-/^+ r e^-'H2{Y,{s), e)ds 

Jo 



(2.48) 
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The main part of the proof is based on representations (2.46)- 



Lemma 41 allows us to use the estimates established in Lemma 



37 



up to 



time ff. In particular, now we can conclude that the family 



e-°sup|y,,2(t) -e-^-*y2| 

t<T, 



(2.49) 



e>0 



is slowly growing thus improving Lemma 38 



To obtain the desired convergence for r/^, we analyze the r.h.s. of (2.47) 



term by term. The convergence of the first term was one of our assumptions. 



For the second one, we need to estimate Hi{Ye,e). Using Lemma 37, the 
boundness of 2 and the definition of f^, we see that 



supe-^+*yA(i)r.,2(t)| = Op(e"+"P). 

t<Tt 



(2.50) 



This estimate and Theorem 31 imply that 



Jo Jo 



YMs)\Y,,2{s)\ds- 



K2 



2-a 



Op(e"P|loge| 



Let us estimate the third term in (2.47). We can use the last estimate along 



with (2.47) and Lemma 40 to conclude that the distributions of positive part 
of At log I r/^ I form a tight family. Therefore, (2.46) implies that 



00, 



0. 



Combined with Ito isometry and Lemma 40 , this implies 

iV+(f,)AiV+, e^O, 



which completes the analysis of VjJ and, due to (2.46), of f^. 



To obtain the convergence of r/^ , we study (2.48). Combining (2.49), 
the inequality 



'2A_f„,2 



\Y,,i{t)\Yl^{t) < 2|y,,i(t)| (^|y,,2(t) - e-^-'y2V + e — 
and the definition of we see that for any q G (0, ap), 

supe^-*|y,,i(t)|y,2^(t) = Op (e^P+^-^e^-"' + e"^) . 
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Hence, as a consequence of Theorem 31 and (2.46) we have 



e^-'H2{Y,{s),e)ds = Op ( ( e"^'-^+"e^-^'= + e"^ ) | log e 



Op (^J^e"{i-(i-p)^-A+)+("P-9) +g"P^ |loge|) . 



Combining this and Lemma 36 in (2.48) we obtain 

rj- = \7]+\^-/^+y2 + Op(e") + Op (^(^e"(i-(i-p)A-A+)+(ap-g) + ^ap^ | iogg|^ 
+ 0p ('ei-"(i-p)-^-A+-9^ 



which finishes the proof of Lemma [32] by choosing q small enough. 



2.6 Proof of Lemma [33 



Consider the solution to system (2.16)-(2.17) equipped with initial condi 



tions (|2.23|) satisfying (|2.24). Let us restrict the analysis to the arbitrary 

{\rit\<K±}, 



high probability event 



for some constants K± > 0. 



Lemma 42 Let p G (0,1) satisfy (2.18), and let {te)e>o be a slowly growing 
family of stopping times. Consider t'^ = t^ A , then for any 7 > 0, 



limP<^sup|y,2(i)| < +7)e°(^-P)^-/^+ } = 1. 
t<ti ' 



Proof. Let 7 > 0. We recall that A^^ is defined in (2.41) and introduce the 
process 

M,{t) = iV-(t) + e / e-^-^'-'^MY,is))ds, (2.51) 
Jo 

where ^'2 was introduced in Theorem 31 , and the stopping time 
/3, = inf {t : |n,2(t)| > (if- +7)e"^'~^^^-/^+} • 

Using the fact that Y^^i is bounded, it is easy to see that there is a constant 
K\_ independent of t, so that for any t < (3^ At'^, we have 

[\'^-'^'-''>\Y,,i{s)\Y^\{s)ds < i^^_eMi-p)A-/A+_ 
Jo 
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This estimate, along with Duhamel's principle and Theorem 31 implies that 
for some constant C > and any t < (3^ At[, 

|i;,2(t)| <e"('-^)^-/^+h7l + i^i [\~^-^'-'^\Y,,i{^)\Y,\{s)ds + esnp\M,{t)\ 

Jo t<p^ 

< e"(l-P)^-A+i^_ + (^g2a{l-p)A_/A+ ^ g \M,{t)\. 



Hence, using Lemma 36 to estimate M^, we obtain that 



P{/3, < = P ^ sup |y,,2(t)| > (K- + I 
converges to as e —t- proving the lemma. ■ 



Lemma 43 Under the assumptions of lemma ^2, for any p G (0, 7 > 0, 
and C > 0, define pe = (— ploge + C) A . Then, we have 



lim P <^ sup \Y, i(t)|e~^+* < (1 + 7)e"P I = L 



Proof. Define the stopping time 

/3, = inf{t:|y,,i(i)|e-^+*>(l + 7)e"^}. 



As a consequence of Duhamel's principle and Theorem 31 we get the bound 



t<l3. 



sup \Ye,i{t)\e-^+* <e''P + Ki [ e-^+%\{s)\Y,,2{s)\ds 
;/3fApe Jo 



This estimate together with Lemma 42 Lemma [36] and the definition of pe 
implies that for any small 5 > we can find a constant K > 0, so that with 
probability bigger than 1 — 5, the inequalities 

sup |y,,i(t)|e-^+* < + Ke"P+"{i-p)A-/A+(^^ ^ + 

< e"P(l + 2i^y9e"(^-P)^-/^+| logel + i^e^^^P), 
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hold for all e > small enough. Hence, for any small enough e > 0, 



P{(3,<P,} = P{ sup |i;,i(t)|e-^+*> (l+7)e"P^ 

< P |Kpe°(i-p)^-/^+ 1 log e\ + Ke^-^'f > 7} + > 
which implies the result. ■ 



The following is an important consequence of Lemma 42 



Corollary 44 With as in (2.25) it holds that 



lim P{rf < rj = 0. 

e-S-O 



In particular, (2.27) holds. 



re}. 



From now on, we restrict our analysis to the high probability event {t^ > 



Let 6+ = e-"Pe-^+^'=y,,i(r,). Then, ([235]) implies 

ap 1 6 

Te = -T— loge + — log— TT, 



(2.52) 



and 



6 sgn 0^ 



Our analysis of these expressions will be based on the next formula which 
directly follows from Duhamel's principle: 

et = sgnr/+ + e-"P H e-^+' Hi{Y,{s), e)ds + e^'^'P {t^). (2.53) 

JO 



The main term in the r.h.s. of (2.53) is sgnr/^. We need to estimate 



the other two terms. Lemma 36 implies that °'^N^{Tf^) converges to in 



probability as e — t- 0. Let us now estimate the integral term. Relations (2.52 ) 



and (2.53) imply that (re)e>o is slowly growing, and we can use Lemma 



42 



to derive 



sup|ye,2(t)| =Op(e"(i-P)^-/^+). 



(2.54) 



We can now use Theorem 1311 to conclude that 



e-"^'sup|/7i(y,(t),e)| = Op(e' 

t<Te 



a(l-p)X-/\+-ap _|_ ^2-ap\ 
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and (2.18) implies that the r.h.s. converges to 0. Therefore, 



-ap 



Jo 



The above analysis of equation ( 2.53| ) implies that if we define = sgn rj^ , 
then 



Law 



'0 ' 



(2.55) 



which implies (2.28) due to (2.52). It remains to prove (2.29). 



Duhamel's principle along with (2.52) yields 

3+1 \ ^-/■^+ 



6 



Ve + 



H2{Y,{s),e)ds+tN:{n). 

(2.56) 



In order to study the convergence of (r^) we first give a preliminary 
result. 



Lemma 45 



sup %,i{t) - e"Pe-^+*sgn7/+| ^0, e ^ 0. 



Proof. The lemma follows from Duhamel's principle and Lemma 43 



The following result is essentially Lemma 8.9 from [3]. It holds true in 



our setting since its proof is based only on the conclusion of Lemma 45 
Lemma 46 As e ^ 0, 

iV-(r,)^^iV, 



where N is the Gaussian random variable in (2.26). 



We finish the proof of Lemma [33| Recall that the process was defined 

cesses 

H2{Y,{s))ds. (2.57) 



in (2.51) and introduce the stochastic processes 

ft 



-X-(t-s) 







Note that ( |2.56D and ( |2.52[ ) imply 



H2iY,{s),e)ds + eN-{n 



JO 

-A_re «(l-p)A_/A+ 



1], + eM,{T,) + R,{t,) 



e"^-/^+ +eMe(re) + i?e(re 



(2.58) 
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Relations (|2.24l) and (|2.55|) imply 



2/2- 



Lemma 46 and estimate (2.54) imply 



(2.59) 



(2.60) 



Equations (2.59) and (2.60) describe the behavior of first two terms in (2.58) 
and the proof of the lemma will be complete as soon as we show that 



e"^i?e(re) -^0, e ^ 0. 



(2.61) 



We can write the following rough estimate based on (2.54) and Theo- 
rem [3T) 

sup \Re{t)\ = Op(e2a(l-p)A-/A+)^ ^2.62) 

This is not sufficient for our purposes. We shall need a more detailed analysis 
instead. First, note that 

sup |y,,2(t) - eM,{t) - Re{t)\e^-' = e°(i-P)^- A+ |r?7 1 = Op(e"(i-P)^-/^+). 

t<T^ 

Hence, for any 7 > there is a > such that the event 

D, = jsup \Y,^2{t) - eM,{t) - Re{t)\e^-^ < K^e"(^-P)^-/^+ 1 

has probability P(-C'e) > 1 — 7 for e > small enough. Moreover, using 
Theorem 31 we see that for some constant Kj3 > 0, 

Jo 

Then, using the inequality (a — 6)^ < 2a^ + 26^ and defining i^/?,^ = K^K^, 
we see that on D^ for each t < t^, 

\Re{t)\ < Kpe-^~' f{e^-'Y,^2{s)fe-^-'ds 
Jo 

[ e~^-'e^''^^-P'^^-'^+ds + 2Kp f e-^~'^'-'^\eM,{s) + R,{s)\^ds 
Jo Jo 



< 2i^/3,T,e 



A- 



^ ^ e,cxD 



/ e^-'R,{sfds, 
Jo 

(2.63) 
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where M^^oo = supj<^^ |Me(t)|, so that (according to Lemma 36) M^^oo is 
slowly growing. Due to ( |2.62 ) we can find a constant K'^ > (independent 
of e > and t > 0) so that the event 

D', = D,n I sup \RS)\ < K;e"(i-P)^-/^+| 

has probability P{D'^) > 1 — 7 for all e > small enough. Hence, multiplying 
both sides of (2.63) by e^~*, we see that for some constant > and all 

t<Te, 

e^-^\R,{t)\lv'^ <a(t) + CT,e"(i-P)^-/^+ [\^''\R,{s)\lv'ds, 

Jo 



where 



a{t) = C7^e2a{l-p)A_/A+ ^ c^^2^2^^ 



(2.64) 



Using Gronwall's lemma and (2.64) we get 



cl 



2^3a(l-p)A_/A+^gC^e-(i-f)^-/^+t 



^':^^2+a(l-p)A_/A+^2^^gA_t+C^.-(i-f)^-/^+_ 

A— ' 



Hence, 



a 



+C7^e^M,^^(l + ^6' 



7 ,a(l-p)A_/A+^gC7^e"(i-f)^-/^+ > 



Using (2.52), we get that for any q > 0, 



Op ( g"^-/^++°(l~P)^-/^+ +e2-g 



so that (2.61 ) follows, and the proof is complete by choosing q small enough. 



Chapter 3 

Levinson Case 



In this chapter we study Levinson case as presented in Section 1.3 of Chap- 
ter [T] We then apply the results obtained for this case to the 1-dimensional 
diffusion conditioned on rare events as explained in Section 1.4.1 of Chap- 
ter [H 

The chapter is organized as follows. In Section 3.2 we state the main 



theorem for the Levinson case, postponing its proof to Section [3^ A approx- 
imation to the diffusion by the deterministic flow in finite time is presented 
in Section 3.3 This approximation is a key ingredient in all the arguments of 
Section 13.41 In Section 13.51 we state the result on the diffusion conditioned 
on a rare event and derive it from the main theorem and some auxiliary 



statements proven in Section 3.5.1 



3.1 Introduction 

In this section we consider the dynamics in d dimensions. That is, we con- 
sider a C^-smooth bounded vector field 6 in M"^. The unperturbed dynamics 
is given by the deterministic flow S = (5*)tgR generated by b. 

The model has slight modifications from the classical exit problem. For 
this chapter, we introduce three components of perturbations of this deter- 
ministic flow. They all depend on a small parameter e > 0. 

The first component is white noise perturbation generated by the matrix 
ecj, where C^-smooth bounded matrix valued function. 

The second one is e^^^'e, where is a deterministic Lipschitz vector 
field on M*^ for each e, converging uniformly to a limiting Lipschitz vector 
field ^'o, and ai is a positive scaling exponent. These conditions ensure that 
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the stochastic Ito equation 

dX,{t) = + e"i^',(X,(t))) dt + eaiX,{t))dW (3.1) 

w.r.t. a standard d-dimensional Wiener process W has a unique strong so- 
lution for any e > and all initial conditions. 

The last component of the perturbation is the initial condition satisfying 

X,(0) = xo + e"2ee, e>0. (3.2) 

Here 02 > 0, and (■^e)e>o is a family of random variables independent of W, 
such that for some random variable — ^ Co as e — )• in distribution. 
Let M be a C^-smooth hypersurface in W^. If 

Te = inf {t > : X,{t) G M} , 

then on {t^ < 00} we have Xe{Te) G M. We are going to study the exit 
problem from M under the assumptions above. We use M instead of D, 
since M is assumed to be an hypersurface and we want to stick to the 
standard notation. In this setting, we state the main theorem in the next 
section. 



3.2 Main result 

In this section we state the main theorem and its hypothesis. Let us start 
with the assumptions on the joint geometry of the vector field b and the 
surface M. First we define 



T = inf {t > : S^xo G M} , 



and assume that < T < 00. Secondly, we denote z = S'^xo G M and 
assume that b{z) does not belong to the tangent hyperplane T^M. In other 
words, we assume that the positive orbit of xq intersects M and the crossing 



is transversal. The reader can check that this is equivalent to Condition 10 
in Section L3 of Chapter [T] 



In the case of = and ^' = 0, Levinson's theorem states (see |46j . 
Chapter 2], and [351 Chapter 2]) that X^ij^) — )■ z in probability as 
e — 7- 0. Levinson worked in the PDE context and showed how to obtain an 
expansion for the solution of the corresponding elliptic PDE depending on 
the small parameter e. The main result of this note describes the limiting 
behavior of the correction (r^ — T,X^{Tf) — z) and extends [311 Theorem 
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2.3] to the situation with generic perturbation parameters ^0,^^,01, and 
a2- This extension is essential since, as the analysis in [4J shows, in the 
sequential study of entrance-exit distributions for multiple domains one has 
to consider nontrivial scaling laws for the initial conditions; also, considering 
nontrivial deterministic perturbations will allow us to study rare events, see 
Section 13.51 

We need more notation. Due to the smoothness of b, 

b{x) = b{y) + Db{y){x-y)+Qi{y,x-y), x,yeR'^, (3.3) 

where 

\Qi{u,v)\<K\v\'', (3.4) 

for some constant K > and any u,v ^ W^. We denote by ^xit) the 
linearization of S along the orbit of x: 

j^^xit) = A{t)Mt), ^■AO)=I, (3.5) 

where A{t) = Db{S*x) and / is the identity matrix. 

Finally, for any vector v G M'^, we define irbv G M and ttmv G T^M by 

V = TTbV ■ b{z) + ttmv, 

i.e., TTfc is the (algebraic) projection onto span(6(z)) along T^M and ttm is 
the (geometric) projection onto T^M along span(6(2;)). 

Theorem 47 Let a = ai A 02 A 1, and 

Mt) = l{a2=a}^xo(i)?0 + l{ar=a}^xo{t) I ^x,,{s)-^-^ o{S' x)ds 

Jo 

+ l{,=^y^,,it) [ '^-^is)a{S-'xo)dWis), t>0. (3.6) 
Jo 

Then, in the setting introduced above, 

e-"(r, - T,X,{n) - z)^{-7TbMT),7^MMT)). (3.7) 

in distribution. If additionally we require that — )• .^o probability or that 
a2 > a, then the convergence in (3.7) is also in probability. 



Remark 1 The conditions of Theorem 47 can be relaxed using the standard 
localization procedure. In fact, one needs to require uniform convergence of 
— 7- ^'o and regularity properties of b and a only in some neighborhood of 
the set {5*xo : < t < T{xo)}. 
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Remark 2 In applications (see [l],[7j), the parameters ai and 02 can be 
chosen so that the r.h.s. of (|3.7|) is nondegenerate. 



Remark 3 In the case where d = 1, the hypersurface M is just a point. 
Therefore, ttm is identical zero and the only contentful information Theo- 



rem 47 provides is the asymptotics of the exit time. 



3.3 A finite time approximation result 

With high probability, at time T the process is close to z and the hitting 



time Te is close to T. The idea behind the proof of Theorem 47 is that while 
the diffusion is close to z, the process may be approximated very well by 
motion with constant velocity b{z). 

In this section we prove the main ingredient to ensure this approximation. 



Lemma 48 Let he the solution of the SDE (3.1) with initial condi- 
tion (3.2). Let 

e,(t) = e°2-"cD,„(t)ee + e"^-°^xoW [' ^.oisr'^o{S'xo)ds 

Jo 

+ ei-"$,„(t) / '^>,,{sy'a{S'xo)dWis). (3.8) 
Jo 

Then, 

X,{t) = S'xo + e''Ut) 

holds almost surely for every t > 0, where ipeit) = Qe{t) + feit), o,nd r^ 
converges to uniformly over compact time intervals in probability. 

If — )• ^0 'iiT' distribution, then for any T > 0, (f)^ ^ (pQ in distribution 
in C[0, T] equipped with uniform norm, where (po is the stochastic process 
defined in (3.6). 

//^e ^0 in probability or 02 > ct, then the uniform convergence for (p^ 
also holds in probability. 

Remark 4 This lemma gives the first-order approximation for ^^(t). Higher- 
order approximations in the spirit of [15] are also possible. They can be used 
to refine Theorem I 



Proof. Let A* = X^(t) — S^xq and note that it satisfies the equation 
dAi = ((6(X,(t)) - b{S'xo)) + e"i^,(X,(t))) dt + ea{X,{t))dW{t), 
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with initial condition = e^^^^. We want to study the properties of this 
equation. We start with the difference in b. Since 6 is a vector field, we 
may write 



b{X,{t)) - b{S'xo) = Db{S'xo)Al + QiiS'xo, A*; 
Also, we can write 

^e{X,{t)) = ^o(5*xo) + Q2{S'xo, A*) + i?,(5*xo 

and 



a{X,{t)) = aiS'xo) + Q-iiS'xo,A 



e/1 



(3.9) 

(3.10) 
(3.11) 



where 



Re{x) = ^e{x)-^0{X)=0{1), € ^ 0, 

uniformly in x; Qi : R"^ x R'^ ^ R'^ , i = 2,3 satisfies 

\Qi{u,v)\ < K\v\, u,v e R'^. 



(3.12) 



We can assume that the constant > in (3.4) and (3.12) is the same for 
simplicity of notation. 

Let Q = Qi + e"iQ2 + e'^^R,. Combine ([3^, (Islol), and Ku\ to get 



dAl = (A(t)A* + e°i^'o(5*xo) + Q(5*xo, A*)) dt 

+ e {a{S'xo) + QsiS'xo, A*)) dW{t), 
A° =e°2^,. 



(3.13) 
(3.14) 



Hence, applying Duhamel's principle to (3.13) and using (3.8), we get 

A* = e"Ge(t) + ^.,{t) f $,.,(s)-iQ(5^xo, Al)ds 
Jo 

+ e^,,{t) I ^^,{sr^Q^{S'xo,Al)dW{s) 
Jo 

= e"Ge(t) + e',(t), (3.15) 

where is defined by ( 3.15| ). A simple inspection of (3.8) shows that 
(0e)^^Q converges in distribution in C(0,T) to the process (/'o(0- This con- 
vergence is in probability if a2 > a or — )■ in probability. Therefore, the 
lemma will follow with 6^ = + e~"0' if we show that 



e-"sup|e;(t)| 0, e^O. 

t<T 



(3.16) 
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For any 6 G (1/2, 1), we introduce the stopping time 
le{6) = inf |t > : |A*| > e""^} . 
Now, e', = e;_i + eG;_2> where 

Jo 



and 



Jo 



Bounds (3.4), and (3.12) imply 



sup I e; 1 (t) I = 0(e2"'5 + e"i+"'^) + o(e"^ ) = o(e^ 

t<TAh{S) 



(3.17) 



Likewise, (3.12) for Q3 and BDG inequahty imply that for any k > there 



is a constant such that 



p<^ sup |e',2(t)| >i^«ei+°n <K 

\t<Thh{5) ' I 



(3.18) 



for all e > small enough. Then, this together with (3.17) imply that 



e-"'' sup |e;(t)|Ao, e^O. 

t<T/\hi5) 



(3.19) 



Then, if le{6) < T we use (3.15) to get 
1 = e""^ sup I A* I 



t<TAh{5) 

<e"(^-'') sup \e,{t)\ + e-""^ sup |e',(t)|. 



t<TAh{5) 



t<TAlc{S) 



The r.h.s. converges to in probability due to (3.19) and the tightness of 



distributions of O^. Hence, < T} — )• as e — )• 0. Using T instead 

of T A /e((5) in (3.17) and ( 3.18| ), we see that with the choice of 5 > 1/2, 
(3.16) follows and the proof is finished. ■ 
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3.4 Proof of Theorem [47 



As we said before, the core idea of the proof is to approximate the behavior of 
the process with that of the deterministic flow in a small neighborhood 
of z. Let us start analyzing the process X^{t) — z for t close to T. Let 
us first estimate the deviation of the flow S from the motion with costant 
velocity b{z). Let 

r±{t, x) = S^^x -{x± th{z)) , t>0, xeR'^. (3.20) 



Lemma 49 There are constants Ci and C2 so that for any t > and x G M'^ 
sup|r±(s,x)| < Cie'^^\t\x - z\ +t^). 

s<t 

Proof. We prove the result for r+. The analysis of r_ is similar since 5'~*x 
is the solution to the ODE 

^S-'x = -b{S-'x). 
dt 

Let L > be the Lipschitz constant of b. The proof follows from the 
inequalities: 

|r+(t,x)| < / \b{S'x) -b{z)\ds 
Jo 

<L [ \S'x-z\ds 
Jo 

<L / \r^{s, x)\ ds + L / \x + sb{z) — z\ ds 
Jo Jo 

<L / |r+(s, a;)| ds + L / \x — z\ds + L / s\b{z)\ds 
Jo Jo Jo 

<L |r+(s, x)\ds + Lt\x - z\ + t"^ L\b{z)\/2. 
Jo 

The result follows as an application of Gronwall's lemma. ■ 

Lemma 50 Let 7 G (a/2, a). Then, there are two a. s. -continuous stochas- 
tic processes Te^± such that 

sup |r,,±(t)| Ao, e^O, 

ie[0,eT] 
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and almost surely for any t E [0, e"^] 

X,{T -t) = z- tb{z) + (0,(r -t) + r,,_(t)) 

and 

X,{T + t) = z + tb{z) + e° ($.(t)0.(r) + r,,+ (t)) . 



(3.21) 
(3.22) 



Proof. Due to Lemma |48| the flow property, and (3.20) we have 

X,{T -t) = S^-'xo + e"0,(r - t) 
= S-^z + e^^.iT -t) 
= z- tb{z) + r_(t, z) + e"(/),(r - t). 



The first estimate with T^-(t) = e^"r^(t, z) fohows from Lemma 49 for 
X = z. 

Due to Strong Markov property and Lemma 47 the process Xe{t) = 
X^{t + T) is a solution of the initial value problem 

dX,{t) = {b{X,{t)) + e"i^,(X,(t)))(it + ea{X,{t))dW, 
XM = Xe{T) = z + e^UT), 

with respect to the Brownian Motion W{t) = W{t + T) — W{T). So, again, 
applying Lemma 47 to this shifted equation, we obtain Xeit) = S^z+e^4>e{t), 
where, for t > 



4(t) = «'.(t)0e(T) + 0,(t), 



and 



Jo Jo 

where converges to uniformly over compact time intervals in probability. 
Then due to ( |3.20D , 

X,{t) = S'z + e''{^,{t)Mt) + Oe{t)) 

= z + tb{z) + r+(t, z) + e°($,(t)(/.e(t) + eS))- 

Hence, with T^^j^{t) = 9e{t) + e~"r+(t, z) the result is a consequence of 
Lemma [49l ■ 



Let us now parametrize, locally around z, the hypersurface M as a graph 
of a C'^-function F over T^M, i.e., y i— t- z + y + F{y) ■ b{z) gives a C^- 
parametrization of a neighborhood of z in M by a neighborhood of in 
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T^M. Moreover, DF{0) = so that \F{y)\ = 0{\y\'^), y ^ 0. With this 
definition, it is clear that, for w £ M.'^ with w — z small enough, w G M if 
and only if 7rh{w — z) = F{-KMiw — z)). 
Let us define 

VLi,, = {r, = inf{t > : n {X,{t) - z) = F (vTAf {X,{t) - z))}], 
VL2,, = {\n-T\<e'}, 

0.^ = 9.1^ r\Vt2e- 



Lemma 51 P(Oe) — )• 1 as e 



0. 



Proof. The definition of F and Lemma 47 imply that as e — )• 0, P(r2i^e) 1- 
We use (3.22) to conclude that 

TTfc (X,(r + 6^) -z) = e^l + e°-%fe (cl>,(e^)0,(r) + r,,+(e^))) , 

and 

F {ttm {X,{T + e^) -z)) = F (e^vTAf {^,{e^)UT) + r.,+ (e^))) • 
Since = 0(|a:p), these estimates imply that 

limsupP ({re > T + e^} n Jli,,) 

< limsupP {^fe {X,{T + e^)-z)<F {-km {X,{T + e^) - z))} = 0. 



It remains to prove 



lim P {r, < T - e^} = 0. 

e-S>0 



(3.23) 



Let us denote the Hausdorff distance between sets by d{-, •). Then an obvious 
estimate 

d{{S^xo :0<t<T-6},M)>c6 



holds true for some c > and all sufficiently small 5 > 0. Now (3.23) follows 
from Lemma 47 and the proof is complete ■ 

Lemma 52 Define r^' = t^ — T . Then, 



0, 



0. 
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Proof. Let us define = {0 < r,' < e'^} n Jli,, and B, = {-e^ < r,' < 0} n 



^i,e, SO that = AeU B^. We can use (3.22) and the definition of Vti^e to 
get 

IaA + Ue^vr, ($.(r;),/.,(r) + r,,+ (r,')) = l^.F (e^vr^ {^^{T'Me{T) + r,,+ (T,'))) . 
This imphes 

-Uvrfe(cI>,(r;)0,(r) + r,,+ (rO) 
= -U7r,($,(r;)0,(r))+r,,i 

= -U7r60,(r) + UvTfe ((/ - $,(r,'))<Ae(r)) + r,,i, (3.24) 

where r^^i is a random variable that converges to in probability as e — )■ 0. 

Likewise, since = T — (— r^) and Is^'^'e ^ 0, we can use (3.21 ) and the 
definition of to see that 

IsA+^B^e^^b {UT + <) + T,,-{-<)) = ^bF (e" {UT + <) + r,,_(-r,'))) . 

Hence, proceeding as before, we see that 

lB,e""r,' = -lB,7rb0,(T + r,') + r,,2 

= -lB,vrb</.,(r) + iB^n (0e(r) - 0,(r + r,')) + re,2 

for some random variable r^^2 such that r^^2 — >• in probability as e — )• 0. 



Adding this identity and (3.24), we see that on Q, 



= -TiMT)+U^'^h {{I - $.(r;))0,(r))+lB^7rfe (0,(r) - </),(r + r^))+r,,i+r,,2. 
Due to Lemma [STj to finish the proof it is sufficient to notice that as e — )• 



and 



sup \{I-<!>,{t))UT)\ 

0<t<e-< 



sup |</.,(T) -(/>,(r + t)| 

o<t<e-y 



0. 



(3.25) 
(3.26) 



Lemma 52 takes care of the time component in Theorem 47 We shall 
consider the spatial component now. 



Let A^ and B^ be as in the proof of Lemma [52} Then, (3.22) implies 



= U (e-"ri6(z) + 0,(T)) + lA, [(a>,(r,') - I)UT) + r,,+ (r,')] (3.27) 
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Likewise, from (3.21) we get that 



1b, (X,(r,) - z) e-" = lB,e-"r,'6(z) + 1b, {^T + t'J + r,,_(-r,')) 

= 1b, {e-'^rXz) + UT)) + 1b, [{MT + <) - UT)) + r.^-O] • 

(3.28) 



Adding (3.27) and (3.28) and preceding as in the proof of Lemma 52 
that 

(Xe(re) - z) - ttmMT) = (e'^T^ + TTtMT)) b{z) + p,, 



we see 



where, due to (3.25), (3.26) and Lemma 50l — )• in probabiHty as e — )• 0. 



From this expression and Lemma 52 we get that 



(X,(r,)-z)e-"-7rM0,(r) -^->0, e ^ 0. 

Then, using this and the convergence in Lemma [52] 

e-°(T, - T, X,(r,) -z) = R, + G{UT)), 

where Re is a random variable such that i?^ — t- in probabihty as e — t- 0. 
G is the continuous function x i-)- {—nbX,TTMx)- Hence, Theorem 47 follows 
from the convergence in Lemma [47} 



3.5 Conditioned diffusions in 1 dimension 



In this section we apply Theorem 47 to the analysis of the exit time of 
conditioned diffusions in 1-dimensional situation for the large deviation case. 
Suppose, for each e > 0, is a weak solution of the following SDE: 

dX,{t) = h{X,{t))dt + e(T{Xe{t))dW{t), 

where h and a are functions on R, such that h{x) < and (t{x) ^ for 
all X in an interval [01,02] containing xq. We introduce 

Te = inf{t > : X^{t) = ai or 02} 

and Bg: = {X^{Tf^) = 02}. Since 6 < 0, i?^ is a rare event since lim^^o P(-Se) = 
0. More precise estimates on the asymptotic behavior of P{B^) can be ob- 
tained in terms of large deviations. However, here we study the diffusion X^ 
conditioned on the rare event B.. 
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Let T{xq) denote the time it takes for the solution of x = —b{x) starting 
at xo to reach a2- Given that 6 < on the hole interval [01,02], a simple 
calculation shows that 



Tixo) 



0.2 1 



Xo 



-dx. 



Theorem 53 Conditioned on B^, the distribution of e ^(t^ — T(xo)) con- 
verges weakly to a centered Gaussian distribution with variance 



«2 „2 



dy. 



To prove this theorem, we will need two auxiliary statements. Their 



proofs are given in Section 3.5.1 



Lemma 54 Conditioned on B^, the process is a diffusion with the same 
diffusion coefficient as the unconditioned process, and with the drift coeffi- 
cient given by 



b,{x) = 6(2;) +eV2(x) 



where 



h^{x) = exp 



XTi h,{y)dy 

Hy) 



dy}. 



(3.29) 



Further analysis requires understanding the limiting behavior of be . This 
is the purpose of the next lemma: 

Lemma 55 There is 6 > such that 



limsupe ^ sup \be{x) + b{x)\ < 00. 

e-s-O \xe[xo-5,a2+S] J 

Remark 5 Although we need the condition that b{x) < for all x G [oi, 02] 



for Theorem 53 to hold. Lemmas 54 and 55 hold independently of the sign 
properties of b. 



Proof of Theorem 53, Let us fix /3 G (1,2). Lemmas 54 and 55 imply 
that Xe conditioned on B^, up to satisfies an SDE of the form 



dXeit) = (-6(X,(t)) + e%,,;3(^e(t))) dt + eaiXeit))dWit), 
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for some Brownian Motion W and with ^e,i3 ~^ uniformly as e — )• 0. We 
can assume that after time Te, this process stih follows the same equation at 
least up to the time it hits xq — 5 ov ai + 5. 

So, having the dynamics from x = —b{x) as the underperturbed dy- 
namics, we can apply Theorem 47 (taking into account Remark [T]) to see 
that 

p 1 /■'^(^"^ 1 

e-i(T,-r(xo)) -— -<I>.o(r(xo)) / '^^„HsMS'xo)dW{s), e ^ 0, 



(3.30) 

where S^xq is the flow generated by the vector field —6, the time T{xo) 
solves /S^^^^^xq = a2, and is the linearization of 5 near the orbit of xq. 
The limit is clearly a centered Gaussian random variable. To compute its 
variance we must first solve 

|$,„(i) = -6'(5*xo)$,oW, ^.o(0) = l- 
The solution to this linear ODE is 

$,„(t) = exp|-^ b'{S'xo)dsY 
so that after the change of variables u = S'^xq in the integral, we get 

Using this expression and ltd isometry for the limiting random variable 
in (3.30), we get that the variance of such random variable is 

r-T(xo) a^S'xo) 



b'^iS'xo 



-dt. 



We can now use the change of variable u = S^xq to get the expression in 
Theorem [ 



3.5.1 Proof of Lemmas 



and 



Proof of Lemma 54, Let us find the generator of the conditioned dif- 
fusion. To that end we denote the generator of the original diffusion by 



LJ{x) = b{x)f'{x) + — (T^(x)/"(x) = lim 
2 t->0 



E,/(X,) - f{x) 



(3.31) 
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where / is any bounded C^-function with bounded first two derivatives and 
E3; denotes expectation with respect to the measure Px, the element of the 
Markov family describing the Markov process emitted from point x. 

Let us denote u^^x) = PxiBe)- This function solves the following boundary- 
value problem for the backward Kolmogorov equation: 

L^Ue{x) = 0, u^{ai) = 0, Ue[a2) = 1. 

A, a unique S' 

XTi h,{y)dy 



Using (3.31), it is easy to check that a unique solution is given by 

Ue{x) 



where he is defined in (3.29). 

Now we can compute the generator of the conditioned flow. For any 
smooth and bounded function f £ C"^ with bounded flrst two derivatives, 
we can write 



where 



\Re 



[X 



= u;\x)EJ{Xe{t))lB, 

= U:\x)Exf{Xe{t))lBA{r.>t} + Re 

= U;\x)ExE,[f{Xeit))lB,l{r.>t}\:Ft] + Re 

= U;\x)EJ{Xe{t))Px,(t){Be)+Re 
= U-\x)E^f{Xe(t))Ue{Xe{t)) + Re, 

\E,f{Xe)lBA{r,<t}\ < C{x)F{Te < t} = o{t) 



for some C{x) > 0. Therefore, we obtain 

E.,[f{XemBe]-f{x) 



Lefix) 



lim 

lim 
1 



U-\x)EJ{Xeit))Ue{Xe{t)) - f {x) 



lim 



E^f{Xe{t))Ue{Xe{t)) - f {x)Ue{x) 



Ue{x) t^O t 

'6(:r)+.V(x)^) f'{x)+e'^r 



b{x) + e^a^(x 



hejx) 
la, he{y)dy^ 



f'/ \ I 2^'^(-^) till \ 

f ix) + e -^^f [x) 
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completing the proof. ■ 

Proof of Lemma |55[ The proof is a variation of Laplace's method. Let 

= 2 / -if\dy, X > ai, (3.32) 

so that he{x) = e"*^^)/*^^. We take any /3 G (1,2) and break the integral of 
in two parts: 

r e-'^(y)/^'dy = h,,{x) + h,2{x), 



lai 

where 



Ie,iix) = I e-^^y^/^'dy, (3.33) 



and 



/.,2(x) = / e-^'^y)^^ dy. (3.34) 

The idea is to prove that I^^i is exponentially smaller than I^^2 and then 
estimate Ie^2- 

We start with some preliminaries for the function <I>. Since both b and 
a are and o" 7^ in [ai, 02] we conclude that $ is a function so that 
we can find a function i? : M x M — t- M and a number 6q > such that for 
every x,y G [ai, 02 + 60], we have the expansion 

$(y) = $(x) + ^>'(a;)(y - x) + R{x, y - x), (3.35) 

and 

\R{x,v)\ < Ki\v\^, X e[ai,a2 + do],v eR, (3.36) 

for some Ki > 0. 

To estimate Ie,i, we introduce 

Je,l(x) = r, r,, . Ie,i{x), X € [ai,a2 + So]. 

Since <I> is decreasing, we have that for some constant K2 > independent 
of x G [ai , 02 + (Jo] ) 



J. 



,(3;) < ^e(*(x)-$(-e'^))A^ (3.37) 



e2 



Since (3 < 2 and is negative and bounded away from zero, we conclude 
that there is a(e) such that a(e) = o{e'^) as e — )• and 

sup Je,iix) < a{e). (3.38) 

a;e[ai,a2+<5o] 
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We now estimate 1^,2- Using expansion (3.35) and the change of variables 
u = — — x)/e^, we get 



Ie,2ix) = e 



-${x)/e2 / ^-<i>'(x)(y-x)/e2~R{x,y-x)/e^ 



dy 



^2 , rO 



2h{x) ^ 



^u-R{x,-e-'u/^'{x))/e'^^ 



(3.39) 



where we use ( 3.32[ ) to compute the derivative of <I>, and we define Je,2 
by (3.39). Hence, combining (3.37) with the definition of and (3.39), we 

get 

1 

6,(x) = h{x) + 



Due to (3.38), the proof will be complete once we prove that for sufficiently 



small (5 > 0, 



limsupe ^ sup |Je,2(2;) — 1| < cxd. 

e-^O \xe[xo-5,a2+&] ' J 

Note that for any 6 G (0,xo — ai), some constant = K3{6) > and 
all X G [xq — <5, a2 + S], 





\Je,2{^)-l\= [ 



(x)/e2-/3 
$'{a;)/e2-/3 

+ / e"du 



'$'(z)/e2-/3 



Using (3.36) we see that for some constant > independent of x G 



[xo — 6,a2 + S] and u G M, 

\R{x,-e'^u/^'{x))\/e^ < K^e^u^. 

In particular, 

sup sup \R{x,-e'^u/^'{x))\/e^ < K4e'^'^'^-^\ 

xe[xo-S,a2+5] ue[$'(z)/e2-/3^o] 



CHAPTERS. LEVINSON CASE 



80 



Since /? > 1, the r.h.s. converges to and we can apply a basic Taylor 
estimate which implies that for all e > small enough, 



sup sup 

3;e[xo-<5,a2+<5] ue[<I''(a;)/e2-/5,0] 



for some K5 > 0. Using this fact in the integral of (3.40), we can find a 
constant Kq = Kq{5) > such that 

sup \J,,2{x) - 1| < Kae'^ + e-^^l'^~\ 
which finishes the proof. ■ 



Chapter 4 

Conclusion 



This chapter is devoted to give further discussion of the topics covered in 



this text. In Section 4.1 we made some comments related to the apphcation 



of normal forms. In Section 4.2 we comment about the exit problem in the 
case where the deterministic flow has a unique saddle point. In Section [4.3| 
we present an open problem related to scaling limit and show a possible 
relation with Chapter [3} 

4.1 Normal Forms 



In this thesis, a transformation (normal form transformation) is used 
to conjugate the original equation for into a non- linear perturbation of 
the linearized equation. This is done so we can avoid an approximation 
step between our original equation and its linearization. There is evidence 
that a similar methodology has been in the mind of the researchers since 
the publication of |L22j. A concrete conjugation of the original equation into 
the linearized system was used in [3]. Although this result was successful, 
it required certain assumptions that are removed in this work (in the 2- 
dimensional setting) by conjugating to a non-linear system instead. As far 
as the author knows, it is the first time a program of this nature has been 
successful. 

Inspired by [13] , in [30] a normal form transformation was applied to an 
epidemiological model. Contrastingly to our case, this is a specific equation 
and not an abstract setting. In [3j the normal form theory is presented 
for stochastic differential equations in the abstract setting. Although the 
transformation is presented, no estimates are computed as in this work. 
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Further the development of normal form theory in [3] is not complete. For 
example, it excludes the one-resonant case. 

Although normal form theory has proved to be a powerful tool in dy- 
namical systems, in probability is still not clear how powerful the theory 
really is. In this text we use the very explicit shape of the nonlinearity in 
the normal form to obtain specific estimates that successfully lead to a com- 
plete solution of the problem in Chapter [2] As far as the author knows, this 
work is the first time in which normal form theory is applied in an abstract 
setting and is used to obtain tight estimates that lead to a solution of a 
probabilistic problem. The approach presented has some further generaliza- 
tions in which the application of normal forms may be useful. We give a 
brief presentation about the possible complications that may be found. 



4.2 Escape from a Saddle: further generalizations. 

In this work we have studied the exit problem for small noise diffusions. In 
particular, we have shown the existence of possible asymmetries in the case 
in which the fiow generated by the drift admits a saddle point. The proof is 
restricted to the 2-dimensional setting. Let us discuss about this particular 
restriction. 

Our method of proof was to transform the original equation into a very 
specific non-linear equation known as normal form. Then, we obtained sev- 
eral estimates that intensively uses the smallness of the noise and the specific 
form of the nonlinearity in the normal form. Let us recall the form of the 
nonlinearity. 

In our case, the nonlinearity i n th e normal form is given by a finite sum of 
resonant monomials (see Section 



2.3 



of the form ( ^ ), where 



{a^ ,02)^^ satisfy the resonance relations 

a^A+ — a^X- = ±A-|-, 

of some order r = af + > 2. If we were to generalize the argument in 
Chapter [2] to the d-dimensional case, we would need to take into account 
the particular form that the nonlinearity would have in the normal form. 
Indeed, there are two points to consider: 

1. The resonant monomials of order r > 2 are of the form 

^"l^'* ^"^d.,! rr'^d.d-s 
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where the vector = (oj^i, 0,^^) S Z'^ satisfies 

Oi.iAi + ... + ai,dArf = Aj 
ai,i + ... + tti^d = r, 

for each i = 1, ...,d. Here Ai, A(i are the eigenvalues of the matrix 
V6(0). 

2. According to |38l Theorem 3, Section 2], the nonhnearity A^, after 
being transformed by a normal form transformation of degree R > 1, 
will be of the form 

N{x) = P{x) + Q{x), 

where P is a finite sum of resonant monomials, and Q is a correction of 
order jxl'^"^^ (as |a;| — )• 0) when the vector of eigenvalues A = (Ai, A^) 
is not one-resonant and identically when A is one-resonant. 

The first point implies that to obtain the exponents aij more combina- 



torial work than the one put in Section 2.3.2 is needed. Still this is not the 
biggest difficulty. The biggest difficulty relies on the lack of structure of the 
correction Q in the case A is not one-resonant. Indeed, to have any hope that 
our techniques in Chapter [2] work, we require at least that whenever ai^k 7^ 
for some k < i, then aij ^ for some k < j < d (this is in the case we order 
the eigenvalues as usual: ReAi > ... > ReAf^). There is no guarantee that a 
condition of this form holds in the not one-resonant case. In conclusion, a 
higher dimensional analogue for the saddle case can be obtained using the 
techniques presented in this theses only in the one-resonant case. This is 
so, unless the particular structure that the eigenvalues Ai, A^ have in this 
case implies that a normal form transformation can be chosen so that the 
non-linearity of the transformed drift is a finite sum of resonant monomials 
with no correction. As far as the author knows, this is an unsolved issue in 
normal form theory. 

There are still some results to be filled in order to complete the case in 
which the deterministic flow has a saddle, and hence the case in which it 
admits an heteroclinic network. This result is worthwhile pursuing since the 
implications of the asymmetry found in Chapter [2] has very interesting ana- 
logues in higher dimensions as famous chaotic systems (such as the Lorentz 
system) in higher dimensions exhibit homoclinic behavior (see \64\ Chapters 
27,30 and 31] for further examples). 
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4.2.1 A non-smooth transformation alternative 



As discussed in Section 2.3.2 , it is possible to conjugate a nonlinear equation 
to a linear one. The restriction for Ito equations is that this transformation 
has to be at least C^. Recent results have extended Ito's formula for func- 
tions with less smoothness. The first result of this nature is the well known 
Tanaka's formula |54l Chapter IV], which relies on the existence of local time 
for one dimensional semimartingales to extend the range of applicability of 
Ito's formula to convex functions. For higher dimensional semimartingales 
there is no local time, so there was no immediate high dimensional analogue 
for Tanaka's formula. For a long time Tanaka's formula remain the more 
general change of variables (in terms of smoothness requirements) known. 
Recent studies have established change of variables for higher dimensional 
semimartingales with less smoothness |57j . [58], [29], |26j . Let us give a 
brief (and informal) comment about this setting. 

Consider / : M'^ — )• M to be a continuously differentiable function. Let Z 
be a semimartingale in M*^ with Z{t) = V{t) + M{t), M being a martingale 
and V a stochastic process with bounded variation paths. Then [57J, [58j, 
[29] . [26] agree that if the quadratic covariation [f{Z),Z^] is well defined 



for every 1 < j < d, then Ito's formula holds: 

rt 



f{Z{t)) 



f{Zm + I V/(Z(.))dZ(.) + ^ ^[9,,/(Z),Z^](t). 



We recall the definition of quadratic covariation (see [Ml Section V.5]): 

Definition 56 Let H and J be two continuous stochastic processes in M. 
The quadratic covariation of H and J, denoted as [H, J] , is, when it exist, 
the continuous process of finite variation over compacts, such that for any 
sequence an of random partitions tending to the identity, 

[H, J] = H{0)J{0) + jim_ j;(//^"+^ - i/^")(/7^"+^ - H^"), (4.1) 



n— >oo 



uniformly over compacts in probability. Here, for any random S > 0, the 
process H^ is short for 1 1— )• H{t A S), and an is the sequence = Tq < ... < 
TJ}^, where supj(T-'^-|^ — T") — )• 0, /c„ — )• oo, and, T^^ — )• oo as n ^ oo. 

For our diffusion process X^, there are several problems to consider. One 
is to show that [dxjf{X^),Xl] is well defined. The other, is to prove that 

e-Ma-/(X,),X:] Ao, e^O. (4.2) 
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Once this is established, the result in Chapter [2] follows immediately. 

In order to show that [dxjf{Xe), XI] is well defined, the proposal in [57] , [58] 
[29] is to use the theory of reversible diffusions proposed in [50J. Indeed, as- 
sume for a moment that we know that, for a fixed T > 0, X^{t) = X^(T — t) 



is a diffusion. Then, observe that (4.1) for H = dxjf{X^) and J = XI can 
be written as 

[dxJ{X,),Xl]{t) = - f dxJ{X,{s))dXi{s)- r dxJ{X{s))dXi{s), 

Jo JT-t 

(4.3) 

where both integrals are Ito integrals with respect to different filtrations. In 



order to use this formula to prove (4.2) the first attempt may be to get the 



generator of X^. Under several assumptions (the most important one being 
the ellipticity of the noise) in [50j it is proved that, for a fixed time T > 0, 
X{T — •) is also a diffusion with the same diffusion matrix and with drift 
b = — b{x) + V log pT^t,T{x, X^{T)), where pT-t,T is the transition density of 
the Markov process X^ (which existence is proved in [SO]). Hence in order to 



establish (4.2 ) we first would need to have a bound in e of V logpT-t,T- This 
quantity is of interest in control theory [28j, but there is, as far as the author 
knows, no reference to an estimate in e > 0. Another option that avoids this 
estimate is to extend the filtration to the minimum complete filtration 
that includes J-^ such that X^{T) is measurable and write Doob-Meyer 
decomposition for the process X^ with respect to this filtration. 

This is still undergoing work, that is promising not only because it allows 
to prove the results included in this thesis, but also because it uses several 
tools of modern stochastic analysis. 

4.3 Scaling limits 

In this thesis we proved a scaling limit for the exit problem for two cases, 
the case in which the flow S has a unique saddle and the Levinson case. The 
idea will be to prove scaling limits for more general systems. In particular, 
recall that if the quasipotential has a unique minimizer z, then the exit point 
X^^Te) converges to it in probability as e — t- 0. By a scaling limit, we mean 
find an a > such that the distribution of e~°'{z — X^{t^)) is tight. 



Let V : D y. dD — ^ [0, oo) be the quasipotential given by (1.7): 



y(x, y) = mf {/f ((^) : (^(T) = y, (^([0, T]) C D U dD} 

In order to state the claim, given xq G -D, let Aixo C dD be the set of 
minimizers of y i— )• V{xQ^y). The claim is the following: 
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Claim 57 Suppose A4xq is finite M-xq = {ei, e^}. There is a probability 
distribution v over M, a random number a G (0, 1] and a family of random 
variables {£,e)e>o such that the exit can be written as 

X^{t,) = Uiei + ... + UqCq + e°^,. 

Further, there is a random variable so that — )• distribution as 

e -)■ 0. 



The results of this thesis imply this claim in the case the flow S admits 



an heteroclinic network (see Section 1.4.2). The proof was done by solving 



two simple cases (saddle point and Levinson case) and then using a Poincare 
distributional map argument for each critical point in the network. 

Here we shall proceed similarly: start from simple cases with random 
initial conditions so that a Poincare argument can be applied. The proposal 
is to choose as the base case the well developed stable case [3l]: £ D and 
D is contained in the basin of attraction of 0. It is known that if the domain 
D is attracted to the origin and Mxq = {e} then Xe{Tf_) — )■ e in probability. 
Moreover, if there is a unique extreme trajectory ipo (the one that realizes 
the minimum in V) then for every 5 > 0, 

limP^J sup \X,{t) - Mi - 9. + Oo)\ < 5} = 0, 
^->o [e,<t<r, J 

where 9^ (Oq) is the last time ((/?o) hits a ball of arbitrary small (but 
fixed) radius around the origin. From the perspective introduced in Chap- 
ter [3j consider the process conditioned on exit close to e. This process is 
a semimartingale with the same diffusion matrix as the original process, 
but with a drift of the form b^ = —b{x) + e^(/?e(t,x), where 99^ is uniformly 
bounded. Hence, our results in the Levinson case of Chapter [3] apply. 

Once this result is established, we can follow the same pattern as in [34] 
to study possible asymmetric behavior in metastable process. With this 
development we can show that the idea of random Poincare maps apply to 
a general dynamical system. 



Appendix A 

Large Deviations 



Large deviation theory is a mixture of probability theory, analysis, varia- 
tional calculus, point set topology among others. This theory has been used 
for different purposes. In Section LI we discussed the role played by large 
deviation theory in the development of Freidlin-Wentzell theory. The pur- 
pose of this chapter is to provide a quick reference to large deviation theory 
as needed to understand Section 11.11 

We present the general theory of large deviations. The theory was first 
formulated in the right degree of abstraction by Varadhan |6T] , we follow [52] 
in this exposition. In Section A.l we begin with the basic definitions. In 
Section A. 2 we present the large deviation results related to diffusion pro- 
cesses. 



A.l Large Deviations Principle (LDP) 

Let X he a Polish metric space with metric function d : X x X ^ [0, oo). By 
a probability measure on X, we mean a probability measure on the Borel 
sigma algebra on X. We will give the general definition of large deviation 
principle for a family of probability measures on X. First, recall the following 
definition. 

Definition 58 The function f : X ^ [—00,00] is lower semi- continuous if 
it satisfies any of the following equivalent properties: 

1. liminf^^oo fixn) ^ f{x) for all sequences {xn)ne'M C X and all points 
X £ X such that Xn ^ x in X . 

2. For all x £ X, \uns^QUiiy^Bs{x) fiv) = f{x), where Bs{x) = {y e X : 
d{x,y) < 6}. 
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3. f has closed level sets, that is, f ^^([—00,0]) = {x £ X : f{x) < c} is 
closed for all c G M. 

Here are the key definitions of large deviation theory: 

Definition 59 The function / : — )• [0, 00] is called a rate function if 

1. 1^00, 

2. I is lower semi- continuous, 

3. I has compact level sets. 

Definition 60 A family of probability measures (P)e>o on X is said to 
satisfy , as e ^ 0,the large deviation principle (LDP) with rate — t- and 
rate function I if 

1. I is a rate function, 

2. limsup^_j.Q logPe(C) < —I{C), for every C C X closed, 

3. liminfe_5.o logPe(O) > —I{0), for every O C X open. 
Here the bounds are in terms of the set function defined by 

I(S) = mfI(x), ScX. 

The goal of large deviation theory is to build up an arsenal of theorems 
based on these two definitions. We will not describe most of this theorems, 
since they are out of the scope for the present text. The interested reader 
is invited to consult the standard monographs on the subject j23l Chapter 
4], [24:1 Chapter III], |52^ Chapter 2]. The only theorem that we cite is the 
so called contraction principle. First, we give some remarks 

Remark 6 L It is a standard exercise to show that once the large devi- 
ation principle is satisfied, the rate function I is unique. 

2. In Definition [6^ it is crucial to make a difference between open and 
closed sets. Naively, one might try to replace the second and third 
conditions with the stronger requirement that 

lima,PJS) = -US), ScX. 

However, there are examples that show that this would be far too re- 
strictive. 
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We now present the contraction principle: 

Theorem 61 Let (P)e>o be a family of probability measures on X that sat- 
isfies the LDP, as e — 7- 0, with rate function and with rate function I. 
Let y be a Polish space, T : X ^ y a continuous map, and = o T"-*^ 
an image probability measure. Then, the family {Qe)t>Q satisfies the LDP 
on y with rate and with rate function J given by 

J(y) = inf lix), 
with the convention inf0 I = oo. 



A.2 Freidlin-Wentzell LDP 

In this section we present the large deviation results that Preidlin-Wentzell 
theory is based on. 

Given T > 0, let W{t),t G [0,r], be a standard Brownian motion in 
W^. Consider the process We{t) = eW{t), and let be the probability 
measure induced by on C([0, T]; M*^), the space of all continuous functions 
If : [0, T] —7- equipped with the supremum norm topology. We first state 
the LDP for We derived by Schilder [59]: 

Theorem 62 The family of probability measures (P^)e>o on C([0,T];R'^) 
satisfy a LDP with rate and with rate function 



M<P) 



oo , otherwise 



Here is the space of absolutely continuous functions with square integrable 
derivative. 

The simple case in which the process X^ is the strong solution of 
dXe{t) = b{X,{t))dt + edW{t) 



is a consequence of Theorem [62] and the Contraction Principle [61] Indeed, 
let F : C([0,r];M°') ^ C([0,T];M'^) be the map defined by / = F{g), where 
/ is the unique solution of 

f{t)= rb{f{s))ds + g{t). 



Then, after noticing that F is continuous and some calculation. Theorem 62 
implies the following result: 
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Corollary 63 The law of on C{[0,T];R'^) satisfies a LDP with rate 
and with rate function 

J/ =1 Uo\'Pi^)-bis)\^ds , cPGH' 
\ oo , otherwise 

Now, consider to be the solution of our typical SDE 

dX,{t) = h{X,{t))dt + ea{X,Xt))dW{t). 



As said on Section 1.1a LDP for this process is the base of Freidlin-Wentzell 
theory. It turns out that to obtain a LDP for the law of X^ the contraction 
principle does not apply. Instead, raw approximations have to be made. We 
state the theorem without a proof (see j23l Section 4.2] or [311 chapter 3] 
for a proof). 

Theorem 64 (Preidlin- Wentzell |34| ) Let Hq j, be the space of all ab- 
solutely continuous functions from [0, T] to with square integrable deriva- 
tives. Define the functional I^ by 

= I is) - b{^{s)),a-\^{s)){V is) - bi^isMds, (A.l) 



if if £ H^rp and 9^(0) = x, and oo otherwise. Here b is the drift in (1.1) and 
a = a'^a, with a the diffusion matrix in (1.1). 

Then for each x S M*^ and T > the family (P|)e>o satisfies a Large 
Deviation Principle on C{[0,T];W^) equipped with uniform norm at rate 
with good rate function I^. 



Appendix B 



Appendix to Section 1.2.1 



The purpose of this appendix is to present any technical material left out 
in Chapter [T] Section 1.2.1 This appendix (in contrast with Appendix [A|) 
contains original material in the simple case discussed on Section [Bj Let us 
recall the setting from that section. 

Given two positive numbers X± > 0, consider the diffusion = 

dX,{t) = diag(A+, -X^)X,{t)dt + edW{t). 

Let 6 > and D = (-6,6) x {-6, 6) cM?. We study the exit problem of 
from D. We start the diffusion X^ inside D: X^^O) = (0, xo) G D. 

we used Ito's formula in each coordinate 



Recall that in Section 1.2.1 



to write Duhamel principle for xl and x^. Here we rewrite identities (1.8) 



and (1.9) for easier reference: 
xlit) 

xl{t) = e""-''xo + e 



Jo 



''^dW2{t 



(B.l) 
(B.2) 



Let N{t) denote the stochastic integral in (B.l) 
Recall that rf is defined as 



rf =inf{t>0: \xl{t)\ > 5] . 

First we prove that is finite with probability 1. This is a general fact that 
can be found in the literature, for example in [SI Proposition 1.8.2], but we 
chose to prove it directly from Duhamel principle. We do this in order to 
stress the importance of such a representation in our setting. Without any 
further discussion, we go into the results. 
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Lemma 65 For every 6 > and e > 0, < co P—a.s. 



Proof. Let n G N, it is enough to show that P{t^ > n} — )• as n — )• oo. 



Observe that (B.l) imphes that 



P{rf >n} = p\ sup ee^+^\Af{t)\ < 5 \ 
[te[o,n] J 

<pj sup ee^+^\M{t)\ < 6} 
<pLeAW2 sup |AA(t)|<di. 

[ te[n/2,n] J 

Here the last two inequaUties follow from the properties of the supremum and 
the exponential function respectively. Take no G N such that e~^5 < e^+^/^, 
for every n > uq. Then, for every n > no, 

P J ee^+'^Z^ g^^p |_^(^)| < 5 I < p J |_^(^^| < |_^(^)| > ^ 

ie[n/2,n] J [ te[n/2,n] ie[n/2,n] 

^pl A+n/4 gup |_^(i)|<ll 
[ te[n/2,n] J 

<pLA+n/4 sup |AA(t)|<ll. 
( te[n/2,n] J 

The proof will be finished as soon as we can show that the last probability 
converges to 0. To see this, note that, for every t > 0, the random variable 
Af{t) is a zero mean gaussian random variable with variance Jq e~'^^+^ds = 

— 2A • Denote 



]^ g— 2A+n ■ 

The result follows since, — )• 0, as n — )• oo, and 

P i e^+"/^ sup \M{t)\ < 1 i < P |e^+"/^|AA(n)| < l| 

[ te[n/2,n] J J 
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Lemma 66 For any 6 > 0, 



rf A CO, 



as e — )■ 0. 

Proof. Let 5 > 0. It is enough to prove that P{rf > T} — ^ 0, e — ^ 0, for 
any T > 0. Use Duhamel principle (B.l) to get 

P{rf>r} = pJ sup \xl{t)\>5\ 

[t<TAT^ I 

< P I ee^+^ sup 

The last inequality, Chebyshev inequality [211 Section 3.2], BDG inequal- 
ity Proposition 3.3.28] and Ito isometry pH", Proposition 2.10] imply 
that for some constant Ci, 











Jo 





P{t° >T} < Cie^^+^ e^E 



TAr° 



-2X+S 



dW{s) 



This proves our result. ■ 

The last technical step in this appendix is about the convergence of the 
random variable Af^. Recall that Af^ = AA(t^^) and 



'dW{s). 



Lemma 67 As e ^ 0, Afe ^ Af in probability. 



Proof. The lemma is a consequence of Ito isometry and Lemma 66 Let 
7 > and T-^ = -(2A+)-Mog(7A+) > 0. Due to Lemma 66 we can find 
eo > such that 

P{rf > T^} < 7A+, (B.3) 



for every e G (0,eo). Use Ito isometry and (B.3) to obtain 



E|A4-AA|2 = E / 



e-^^+'ds 



< 



2A. 



2A. 



-P{rf < T^} 



<7, 
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for every e G (0, cq). The result follows since 7 > is arbitrary and 
convergence implies convergence in probability. ■ 
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